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Methods of Eliciting Broadly Neutralizing Antibodies 
Targeting HIV-1 gp41 



Background of the Invention 



Statement as to Rights to Inventions 

Federally -Sponsored Research and Development 

Part of the work performed during development of this invention utilized U.S. 
Government funds. The U.S. Government has certain rights in this invention pursuant to 
INNOVATION Grant No. R21 AI 42714. 

Field of the Invention 

The present invention is related to HIV therapy and prophylaxis. In particular, the 
invention relates to methods for eliciting broadly neutralizing antibodies that target entry- 
relevant structures of HIV-1 gp41. Such methods, and pharmaceutical compositions 
therefor, can be employed to inhibit HIV entry into uninfected cells. 

Related Art 

The development of effective vaccines to prevent infection with HIV remains a 
high priority goal. To date, envelope glycoproteins (gp!60 and gpl20/gp41) have been 
the main focus of vaccine research efforts. One result of this work is the observation that 
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the humoral response generated against native forms of the envelope (primarily oligomeric 
forms of the gpl20/gp41 complex) is more broadly neutralizing than antibody raised 
against denatured and/or monomeric envelope (VanCott, T. C, et aL, J. Virol 71:43 19- 
4330 (1997)). Structural considerations are important components for both understanding 
5 the immunogenicity of the envelope protein and the design of envelope based immunogens 
which induce a broad neutralizing response against HIV. 

A good deal of structural information is available with respect to the 
transmembrane protein (TM or gp41). Predictive work indicated that several regions of 
the ectodomain of gp41 display a high propensity to exhibit certain specific types of 

10 secondary structure (Gallaher, W. R., et aL, AIDS Res. Hum. Retroviruses 5:431-440 
(1989); Delwart, E. L., et aL, AIDS Res. Hum. Retroviruses 6:703-704 (1990)). 
Experimental work employing both synthetic peptides and protein recombinants has 
established that these predictions were generally correct and recently a three dimensional 
structure for a portion of the gp41 ectodomain was reported (Wild, C., era/;, Proc. Natl. 

15 Acad. Sci. USA 59:10537-10541 (1992); Wild, C, et aL, Proc. Natl. Acad. Sci. USA 
97:12676-12680 (1994); Wild, C, et aL, AIDS Res. Hum. Retroviruses 77:323-325 
(1995); Chan, D. C, et al, Cell 59:263-273 (1997)). Results from both solution studies 
and crystallographic analysis indicate that in one form this structured region of the 
transmembrane protein is a trimer of two interacting regions of gp41. This trimeric 

20 structure is a six helix bundle consisting of an interior parallel coiled-coil trimer (region 
one ) which associates with three identical a-helices (region two) which pack in an oblique, 
antiparallel manner into the hydrophobic grooves on the surface of the coiled-coil trimer 
(FIG. 3). This hydrophobic self-assembly domain is believed to constitute the core 
structure of gp41. 

25 A series of studies carried out using both synthetic peptides and recombinant 

proteins modeling the distal regions of the TM involved in generating this structure 
suggest that it (or the gp41 regions from which it is derived) plays a critical role in the 
process of HIV-1 entry (Wild, C, et aL, Proc. Natl Acad. Sci. USA 59:10537-10541 
(1992); Wild, C, et aL, AIDS Res. Hum. Retroviruses 9:1051-1053 (1993);Wild, C, et 

30 aL. Proc. Natl. Acad. Sci. USA 97: 12676-12680 (1994); Wild, C, etaL, AIDS Res. Hum 
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Retroviruses 77:323-325 (1995); Wild, C, et ai, Proc. Natl. Acad. Sci. USA 97:9770- 
9774 (1994); Chen, C. -H., et al, J. Virol 69:3771-3777 (1995)). 

The functional role of the transmembrane protein of HIV- 1 in virus replication was 
shown when the region of the ectodomain of the TM corresponding to amino acid 
5 residues 558-595, which was predictive of a-helical secondary structure (Gallaher, W. R., 
etal t AIDS Res. Hum. Retroviruses 5:431-440 (1989); Delwart, E. L., et al, AIDS Res. 
Hum. Retroviruses 6:703-704 (1990)), formed a coiled-coil structure when modeled as 
a synthetic peptide (Wild, C, etal, Proc. Natl Acad. ScL USA 59:10537-10541 (1992)). 
The peptide modeling this region, DP- 107, was shown to be a potent, virus specific 

10 inhibitor of HIV replication and the inhibitory activity was related to the structural 
components exhibited by the peptide. In both neutralization and cell-cell fusion assays, 
the DP- 107 peptide completely blocked virus infection at concentrations of 1.0 jig/ml. 
Unlike other inhibitors of HIV replication (i.e. soluble CD4) and most neutralizing sera, 
the activity of the DP-107 peptide was not isolate restricted. Using a series of DP-107 

15 analogs containing structure disrupting point mutations and a set of HTV-1 envelope 
constructs containing identical mutations, it has been shown that the structural 
components of the coiled-coil region of the TM were critical to both virus entry and 
fusion phenotype and that mutations which disrupted this gp41 structure gave rise to an 
envelope complex which was unable to mediate virus entry (Wild, C, et al, Proc. Natl 

20 Acad. ScL USA 97:12676-12680 (1994)). 

Studies of the coiled-coil domain of gp4 1 resulted in the identification of a second 
region of the ectodomain of the TM, which when modeled as a synthetic peptide, was also 
a potent, virus specific inhibitor of HIV replication (Wild, C, et al, AIDS Res. Hum. 
Retroviruses 9:1051-1053 (1993)). However, unlike the DP-107 region, the peptide 

25 corresponding to amino acid residues 643-678 of the TM (DP- 178), did not exhibit stable 
solution structure. Experiments with the DP-107 and DP- 178 peptides established that 
both of these materials blocked HIV replication at an early step, most likely during virus 
entry (Wild, C, et al, Proc. Natl Acad. Sci. USA 97:9770-9774 (1994)). This 
observation led to speculation that these peptides might inhibit virus replication by 

30 interacting with and disrupting determinants within the TM that were critical for virus 
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entry. Efforts to better define the higher order structural components that were present 
in gp41 and functioned during virus entry led to the observation that the distal regions of 
the TM modeled by the two inhibitory peptides (DP- 107 and DP- 178) did interact with 
one another to form an oligomeric structure (Wild, C, et al, AIDS Res. Hum. 
5 Retroviruses 77:323-325 (1995); Chen, C. -H., et al, J. Virol. 69:3111-3111 (1995)). 
Recently, this oligomeric structure was characterized as a trimeric, six helix bundle 
consisting of an interior parallel coiled-coil trimer (DP- 107 region) which associates with 
three identical a-helices (DP- 178 region) which pack into the hydrophobic grooves on the 
surface of the coiled-coil trimer (Figure 3) (Chan, D. C, etal, Cell 59:263-273 (1997)). 

10 Research has focused on determining the functional role of these gp41 structural 
determinants in virus entry. DP-107 and DP-178 peptides interact in a specific manner 
with the ectodomain of gp41 and this interaction is critical to their inhibitory activities. 
U.S. Patent No. 5,464,933, Bolognesi et al, describes peptides which exhibit 
potent anti-retroviral activity. Specifically disclosed are the peptide DP-178 (SEQ ID — 

15 NO:3) derived from the HTV-l^j gp41 protein, as well as fragments, analogs and 
homologs of DP-178. The peptides are used as direct inhibitors of human and non-human 
retroviral transmission to uninfected cells. The patent teaches that the peptides may also 
be prophylactically employed in individuals after such individuals have had an acute 
exposure to HIV. 

20 U.S . Patent No. 5,656,4 10, Wild et al , describes protein fragments derived from 

the HIV transmembrane glycoprotein (gp41), including the peptide DP-107 (SEQ. ID 
NO: 1 ) which have antiviral activity. Also disclosed are methods for inhibiting enveloped 
viral infection, and methods that modulate biochemical processes involving coiled coil 
peptide interactions. 

25 While recent work has increased knowledge of the structural components of the 

HTV-1 transmembrane protein, the immunogenic nature of gp41 remains poorly 
understood. It is known that one of two immunodominant regions present in the HTV-1 
envelope complex is located in gp41 (Xu, J. -Y., et al, J. Virol 55:4832-4838 (1991)). 
This determinant (TM residues 597-613) is associated with a strong, albeit 

30 non-neutralizing humoral response in a large number of HTV+ individuals. Also, the 
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broadly neutralizing antibody, 2F5, maps to the ectodomain of gp41 (TM residues 
662-667) (Muster, T. f et al, J- Virol (57:6642-6647 (1993); Muster, T., et al, J, Virol 
55:403 1-4034 (1994)). It is interesting to note that this antibody maps to a determinant 
of the TM that overlaps one of the two regions of gp41 which interact to form the 
5 recently characterized hydrophobic core of the protein (Figure 1). This observation has 
lead to speculation that 2F5 might actually neutralize virus by interacting with and 
disrupting the function of an entry-relevant gp41 structure. An extensive study which 
mapped the antigenic structure of gp41 supports this idea. This work characterized 
several conformation dependent gp4 1 M Abs which mapped to the same region of the TM 

10 as 2F5 (Earl, P. L., et al, J. Virol 77:2647-2684 (1997)). Although the binding sites for 
these non-neutralizing monoclonal antibodies (MAbs) overlapped the 2F5 determinant, 
in competition experiments neither of these antibodies was blocked from binding to native 
protein by the 2F5 MAb. This indicates that while the two dimensional regions to which 
these antibodies map are similar, the three dimensional epitopes to which they bind aire 

15 quite different 

The observation that only one neutralizing MAb (2F5) maps to the ectodomain of 
gp4 1 and that antibodies to the 2F5 epitope are poorly represented in sera from HIV 
infected individuals suggests that, for the most part, gp41 neutralizing epitopes are 
cryptic. The cryptic nature of these neutralizing epitopes is most likely related to the 

20 functional role of the TM in HIV-1 replication which involves mediating virus entry. It 
has been shown that prior to gpl20-CD4 binding the HIV envelope complex exists in a 
non-fusogenic form. While the exact nature of this pre-entry form is unknown, binding 
experiments have established that the non-fusogenic state is characterized by the 
inaccessibility of large portions of the gp41 ectodomain (Sattentau, Q. J. and J. P. Moore, 

25 J. Exp. Med. 174:407-415 (1991); Sattentau, Q. J., et al, Virol 206:713-717 (1995)). 

However, once binding of virus to target cell has occurred, the gpl20-gp41 complex 
undergoes a series of conformational changes that involve reorganization of both the 
extracellular surface component of the HIV-1 envelope protein (SU or gpl20) and TM 
proteins and the formation of structural components within the TM which are believed to 

30 be critical to virus entry. Although the steps involved in the transition from the 
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non-fusogenic to fusogenic state are largely unknown, it is believed that this 
transformation is characterized by the formation of a series of structural intermediates 
within the transmembrane protein which drive the conformational changes required for 
virus entry. The transitory nature of this event and the structures associated with it, rather 
5 than the absence of appropriate structural determinants, are believed to account for the 
poor neutralizing response to the TM component of the envelope system. 

Attention has been given to the development of vaccines for the treatment of HIV 
infection. The HIV-1 envelope proteins (gpl60, gpl20, gp41) have been shown to be the 
major antigens for anti-HIV antibodies present in AIDS patients (Barin, et al t Science 

10 228: 1094-1096 (1985)). Thus far, these proteins seem to be the most promising 
candidates to act as antigens for anti-HIV vaccine development. To this end, several 
groups have begun to use various portions of gp 1 60, gp 1 20, and/or gp4 1 as immunogenic 
targets for the host immune system. However, prior art attempts have thus far met with 
minimal success. _ 

1 5 Thus, although a great deal of effort is being directed to the design and testing of 

HIV vaccines, an effective vaccine is needed. 

Summary of the Invention 

An objective of the present invention is the induction and/or characterization of 
a humoral immune response targeting "entry-relevant" gp41 structures. In its broadest 

20 aspect, the present invention is directed to methods of raising a neutralizing antibody 
response to a broad spectrum of HIV strains and isolates. The present invention targets 
particular molecular conformations or structures that occur, or are exposed, following 
interaction of HTV with the cell surface during viral entry. Such a humoral response can 
be generated in vivo as a prophylactic or therapeutic measure in individuals to reduce or 

25 inhibit the ability of HIV to infect uninfected cells in the individual's body. Such a 
response can also be employed to raise antibodies against "entry relevant" gp4 1 structures. 
These antibodies can be subsequently employed for therapeutic uses, and as tools for 
further illuminating the mechanism of HIV cell entry. 
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One aspect of the present invention relates to a method of raising a broadly 
neutralizing antibody response to HIV by administering to a mammal a peptide or 
polypeptide comprising an amino acid sequence that is capable of forming a stable coiled- 
coil solution structure corresponding to or mimicking the heptad repeat region of gp41, 
(or the N-helical domain of gp4 1 ) . Peptides of this aspect of the invention are exemplified 
by P-15 and P-17 described herein. 

A second aspect of the present invention relates to a method of raising a broadly 
neutralizing antibody response to HIV by administering to a mammal a peptide or 
polypeptide comprising an amino acid sequence that corresponds to, or mimics, the 
transmembrane-proximal amphipathic oc-helical segment of gp41 (attheC-helical domain 
of gp4 1 ), or a portion thereof. Peptides of this aspect of the invention are exemplified by 
P-16 and P-18 described herein. 

A third aspect of the present invention relates to a method of raising a broadly 
neutralizing antibody response to HIV by administering to a mammal a composition 
including one or more peptides or polypeptides which comprise amino acid sequences that 
are capable of forming solution stable structures that correspond to, or mimic, the gp41 
core six helix bundle. This bundle forms in gp41 by the interaction of the distal regions 
(N-helical domain and C-helical domain) of the transmembrane protein. See FIG. 1. This 
aspect of the invention is also directed to novel mixtures of peptides and polypeptides, 
including multimeric and conjugate structures, wherein said mixtures and structures form 
a stable core helix solution structure. A preferred embodiment of this aspect of the 
invention involves raising antibodies to a physical mixture of N-helical domain peptide and 
C-helical domain peptide, for example, P-17 and P-18, P-15 and P-16, P-17 and P-16, or 
P-15 and P-18. 

The present invention is also directed to a method of raising a broadly neutralizing 
antibody response to HIV by administering to a mammal a composition including one or 
more novel peptides and proteins, herein referred to as conjugates, that mimic fusion- 
active transmembrane protein structures. These conjugates are formed from two or more 
amino acid sequences that comprise: 
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(a) one or more amino acid sequences that are capable of forming a stable coiled- 
coil solution structure corresponding to or mimicking the heptad repeat region of 
gp41 (N-helical domain); and 

(b) one or more amino acid sequences that correspond to, or mimic, an amino acid 
5 sequence of the transmembrane-proximal amphipathic a-helical segment of gp41 

(C-helical domain); 
wherein 

said one or more sequences (a) and (b) are alternately linked to one another via 
a bond, such as a peptide bond (amide linkage) or by an amino acid linking sequence 
10 consisting of about 2 to about 25 amino acids. These conjugates are preferably 
recombinantly produced. An example of such a conjugate is described in Example 5. 

In a preferred embodiment of this aspect of the invention, one or more of these 
conjugates folds and assembles in solution into a structure corresponding to, or 

. - mimicking, the gp4 1 core six helix bundle. 

15 The present invention also relates to methods for forming peptides, multimers and 

conjugates of the invention. 

The present invention also relates to pharmaceutical compositions comprising the 
peptides, multimers and conjugates of the invention and a pharmaceutical acceptable 
carrier. 

20 The present invention also relates to polyclonal and monoclonal antibodies that are 

raised to the peptides, multimers and conjugates described in the preceding paragraphs. 

The present invention also relates to a method of administering a composition 
comprising polyclonal or monoclonal antibodies described above to an individual in an 
amount effective to reduce HIV infection of uninfected cells. 

25 The present invention also relates to a vaccine for providing a protective response 

in an animal comprising one or more peptides, multimers or conjugates of the present 
invention together with a pharmaceutically acceptable diluent, carrier, or excipient, 
wherein the vaccine may be administered in an amount effective to elicit an immune 
response in an animal to HIV. In a preferred embodiment, the animal is a mammal. In 

30 another preferred embodiment, the mammal is a human. 
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Brief Description of the Figures 

FIG. 1 illustrates the structural and antigenic regions of HIV-1 gp41. The 
extracellular, transmembrane and cytoplasmic domains are shown, as are the 
transmembrane-proximal amphipathic a-helical segment of gp41 (C-helical domain) and 
5 the heptad repeat region of gp41 (N-helical domain). 

FIG. 2 illustrates the formation of multimeric peptide constructs corresponding to 
the heptad repeat region of gp41 (represented by P-17) and one or more suitable linker 
peptides. 

FIG. 3 illustrates the construction of conjugates of the invention derived from 
10 repeating gp 41 fragments; and their subsequent folding and interaction to form 
immunologically relevant epitopes. 

FIG. 4 depicts the analysis of polyclonal sera to various immunogens by surface 
immunoprecipitation. The precipitations were performed in the presence (+) or absence 
(-) of 10 |jg/ml sCD4. 

1 5 FIG. 5 depicts analysis of polyclonal sera to various immunogens in neutralization 

assays. Immune sera or pre-immune (prebleed) sera were diluted 1:10 and incubated with 
various concentrations of virus (indicated in numbers of tissue culture infectious doses - 
TCDD50). Levels of virus replication were measured by the amount of p24 in the 
supernatant seven days following infection, and normalized to the degree of replication 

20 in the absence of any rabbit serum. The positive (+ve) control used is a strongly 
neutralizing serum from an HTV-1 infected individual. 

FIG. 6: Percent neutralization for gp233 and gp234 sera in different experimental 
formats. FIG. 6a shows the titration of bleed 2 for each animal against HIV-l^ in the 
cell killing assay which uses cell viability as a measure of virus neutralization. MT2 cells 

25 are added to a mixture of virus (sufficient to result in greater than 80% cell death at 5 days 
post infection) and sera which had been allowed to incubate for approximately 1 hr. After 
5 days in culture, cell viability is measured by vital dye metabolism. FIG. 6b shows the 
percent neutralization for each bleed at a 1 : 1 0 dilution against HTV - 1 MN in an assay format 
employing CEM targets and p24 endpoint. In this assay, sera are incubated with 200 
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TCE) 50 of virus for 1 hr prior to the addition of cells. On days 1, 3, and 5 media are 
changed. On day 7 culture supernatants are collected and analyzed for virus replication 
by p24 antigen levels. In each assay format, percent neutralization is determined by 
comparison of experimental wells with cell and cell/virus controls. 
5 FIG. 7 provides an example of a construct of the present invention (SEQ. ID 

NO:75) along with the corresponding nucleic acid sequence used for recombinant 
expression of the construct (SEQ. ID NO:76). 

Detailed Description of the Preferred Embodiments 

The transitory-nature of the HIV -entry event, and the structures associated with 

10 it, account for the seeming lack of neutralizing epitopes within gp41. These structural 
components, which form and function only during virus entry, and remain unexposed or 

are not pfesentln the "native" fusion-inactive envelope complexvconstitute a novel set of 

neutralizing epitopes within gp41. The present invention involves immunization with 
constructs mimicking these highly conserved, gp41 structures involved in virus entry to 

1 5 elicit the production of broadly neutralizing antibodies targeting these structures. Thus, 
this invention is the induction of a humoral immune response targeting these "entry 
relevant" gp41 structures. 

One aspect of the present invention relates to a method of raising a broadly 
neutralizing antibody response to HIV by administering to a mammal a peptide or 

20 polypeptide comprising an amino acid sequence that is capable of forming astable coiled- 
coil solution structure corresponding to or mimicking the heptad repeat region of gp41 
which is located in the N-helical domain as defined herein. Peptides, or multimers thereof, 
that comprise amino acid sequences which correspond to or mimic solution conformation 
of the heptad repeat region of gp41 can be employed in this aspect of the invention. The 

25 heptad repeat region of gp4 1 includes 4 heptad repeats. Preferably, the peptides comprise 
about 28 to 55 amino acids of the heptad repeat region of the extracellular domain of HIV 
gp41 (N-helical domain, (SEQ. ED NO:l)), or multimers thereof. The peptides can be 



BNSDOCID: <WO 004061 6A1_I_> 



WO 00/40616 



PCT/USOO/00456 



-11- 

administered as a small peptide, or conjugated to a larger carrier protein such as keyhole 
limpet hemocyanin (KLH), ovalbumin, bovine serum albumin (BSA) or tetanus toxoid. 

Alternatively, peptides forming a stable coiled-coil solution structure 
corresponding to or mimicking the heptad repeat region of gp4 1 can be employed to form 
5 polyclonal or monoclonal antibodies that can be subsequently administered as therapeutic 
or prophylactic agents. 

To determine whether a particular peptide or multimer will possess a stable 
trimeric coiled-coil solution structure corresponding to or mimicking the heptad repeat 
region of gp41, the peptide can be tested according to the methods described in Wild, C, 
10 etaL Proc. Natl Acad. Sci. USA 89: 10537-10541 (1992), fully incorporated by reference 
herein. 

Shown below is the sequence for residues of the HIV-l^ gp4 1 protein that form 
the N-helical domain of the protein: 
ARQLLSGrVQQQNNIXRAffi^ 
15 (SEQ. IDNO:l) 

Two examples of useful peptides include the peptide P- 1 7, which has the formula, 
from amino terminus to carboxy terminus, of: 

NH 2 -NNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ-COOH 

(SEQ ID NO:2); 

20 and the peptide P-15, which has the formula, from amino terminus to carboxy terminus, 

of: 

NH 2 -SGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARIL-COOH 

(SEQ ID NO:3). 

These peptides are optionally coupled to a larger carrier protein, or optionally include a 
25 terminal protecting group at the N- and/or C- termini. Useful peptides further include 

peptides corresponding to P-17 or P-15 that include one or more, preferably 1 to 10 

conservative substitutions, as described below. A number of additional useful N-helical 

rcL'ion peptides are described in the section entitled "Peptides/' 

A second aspect of the present invention relates to a method of raising a broadly 
30 neutralizing antibody response to HIV by administering to a mammal a peptide or 

polypeptide comprising an amino acid sequence that corresponds to, or mimics, the 
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transmembrane-proximal amphipathic a-helical segment of gp41 (C-helical domain, (SEQ 
ED NO:4)), or a portion thereof. Useful peptides or polypeptides include an amino acid 
sequence that is capable of forming a core six helix bundle when mixed with a peptide 
corresponding to the heptad repeat region of gp41, such as the peptide P-17. Peptides 
can be tested for the ability to form a core six helix bundle employing the system and 
conditions described in Chan, D. C, etal, Cell 59:263-273 (1997); Lu, M., etai. Nature 
Struct. Biol. 2:1075-1082 (1995), fully incorporated by reference herein. 

Shown below is the amino acid sequence for residues of the HTV- 1 ^ gp4 1 protein 
that form the C-helical domain of the protein: 

WNNMTWMEWDREINNYTSLfflSLIEESQNQQEKNEQELLELDKWASLWNWF 
NITNW (SEQDDNO:4) 
Preferred peptides or multimers thereof, that can be employed in this aspect of the 
invention comprise about 6 or more amino acids, preferably about 24-56 amino acids, of 
the extracellular C-helical domain ofHTVgp41. The peptides can be administered as a 
small peptide, or conjugated to a larger carrier protein such as keyhole limpet hemocyanin 
(KLH), ovalbumin, bovine serum albumin (BS A) or tetanus toxoid. This transmembrane- 
proximal amphipathic a-helical segment is exemplified by the peptides P-16 and P-18, 
described below. 

Alternatively, peptides or polypeptides comprising amino acid sequences that 
correspond to, or mimic, the transmembrane-proximal amphipathic a-helical segment of 
gp4 1 , or a portion thereof, can be employed to form polyclonal or monoclonal antibodies 
as therapeutic or prophylactic agents. 

Examples of useful peptides for this aspect of the invention include the peptide P- 
1 8 which corresponds to a portion of the transmembrane protein gp4 1 from the HIV- 1^ 
isolate, and has the 36 amino acid sequence (reading from amino to carboxy terminus): 

NH r YTSLIHSLlEESQNQQEKNEQELLELDKWASLWNW-COOH 

(SEQ ID NO:5); 

and the peptide P-16, which has the following amino acid sequence (reading from amino 

to carboxy terminus): 

NH r WMEWDREINNYTSLIHSLEESQNQQEKNEQELL-COOH 

(SEQ ID NO:6) 
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These peptides are optionally coupled to a larger carrier protein. Useful peptides further 
include peptides corresponding to P- 18 or P- 16 that include one or more, preferably 1 to 
10 conservative substitutions, as described below. In addition to the full-length P- 18, 36- 
mer and the full length P-16, the peptides of this aspect of the invention may include 
5 truncations of the P-18 and P-16, as long as the truncations is capable of forming a six 
helix bundle when mixed with P-17. A number of other useful peptides are described in 
the section entitled "Peptides," below. 

A third aspect of the present invention relates to a method of raising a broadly 
neutralizing antibody response to HIV by administering to a mammal a composition 

1 0 including one or more peptides or polypeptides which comprise amino acid sequences that 
are capable of forming solution stable structures that correspond to, or mimic, the gp41 
core six helix bundle. This bundle forms in gp41 by the interaction of the distal regions 
of the transmembrane protein, the heptad repeat region and the amphipathic a-helical 
region segment roughly corresponding to the N-helical domain and C-helical domain. See 

15 FIG. 1. The bundle structures that form in native virus are the result of a trimeric 
interaction between three copies each of the heptad repeat region and the transmembrane- 
proximal amphipathic a-helical segment. In the compositions of the present invention, 
peptide regions interact with one another to form a core six helix bundle. This aspect of 
the invention is also directed to novel mixtures of peptides and polypeptides, including 

20 multimeric and conjugate structures, wherein said structures form a stable core helix 
solution structure. 

This aspect of the invention can employ mixtures of (a) one or more peptides that 
comprise an amino acid sequence that corresponds to, or mimics, a stable coiled coil 
heptad repeat region of gp41; and (b) one or more peptides that comprise a region that 

25 corresponds to, or mimics, the transmembrane-proximal amphipathic a-helical segment 
of gp4 1 . These mixtures are optionally chemically or oxidatively cross-linked to provide 
additional immunogenic structures that may or may not be solution stable. In addition to 
physical mixtures, and conventional cross-linking, the peptides (a) and (b) can be 
conjugated together via suitable linking groups, preferably a peptide residue having at 

30 least 2, preferably 2 to 25, amino acid residues. Preferred linking groups are formed from 
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combinations of glycine and serine, or combinations of glycine and cysteine when further 
oxidative cross-linking is envisioned. 

A preferred embodiment of this aspect of the invention involves raising antibodies 
to physical mixtures of P-17 and P-18, P-15 and P-16, P-17 and P-16 or P-15 and P-18. 

The present invention is also directed to a method of raising a broadly neutralizing 
antibody response to HIV by administering to a mammal a composition including one or 
more novel peptides and proteins, herein referred to as conjugates, that mimic fusion- 
active transmembrane protein structures. These conjugates are formed from peptides and 
proteins that comprise: 

(a) one or more amino acid sequences of 28 or more amino acids that are capable 
of forming a stable coiled-coil solution structure corresponding to or mimicking 
the heptad repeat region of gp4 1 ; and 

(b) one or more amino acid sequences that correspond to, or mimic, an amino acid 
se q Uence of the transmembrane-proximal amphipathic a-helical segment of .gp41;. 

wherein 

said one or more sequences (a) and (b) are alternately linked to one another via 
a peptide bond (amide linkage) or by an amino acid linking sequence consisting of about 
2 to about 25 amino acids. These peptides and proteins are preferably recombinantly 
produced. 

In a preferred embodiment of this aspect of the invention, one or more of these 
conjugates folds and assembles into a structure corresponding to, or mimicking, the gp41 
core six helix bundle. 

Non-limiting examples of the novel constructs or conjugates that can be formed 

include: 

( 1 ) three tandem repeating units consisting of P- 17-linker-P- 1 8 
(P- 1 7-lmkcr-P-184inker-P-17-linke^^ 

(2) P-17-linker-P-18-linker-P-17, 

(3 ) P-18-linker-P-17-linker-P-18, 

(4) P-17-linker-P-17, 
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(5) three tandem repeating units consisting of P-l 5-linker-P- 16 
(P- 1 5-linker-P- 1 6-linker-P- 1 5-linker-P- 1 6-linker-P- 1 5-linker-P- 16), 

(6) P- 1 5-linker-P- 1 6-linker-P- 15, 

(7) P- 1 6-linker-P- 1 5-linker-P- 1 6, and 

(8) P-16-linker-P-15; 

wherein each linker is an amino acid sequence, which may be the same or different, of 
from about 2 to about 25, preferably 2 to about 16 amino acid residues. Preferred amino 
acid residues include glycine and serine, for example (GGGGS) X> (SEQ ID NO:7) wherein 
x is 1, 2, 3, 4, or 5, or glycine and cysteine, for example (GGC)y, where y is 1, 2, 3 4 or 
5. In any of the described constructs, P-15 and P-17 are interchangeable and P-16 and 
P- 18 are interchangeable. An example of such a construct (SEQ ID NO:77) is shown in 
FIG. 7, along with the corresponding nucleic acid sequence (SEQ ID NO:78) used for 
recombinant expression of the construct. 

Alternatively, polyclonal or monoclonal antibodies can be raised against the 
immunogenic mixtures and conjugates described in this aspect of the invention. Such 
antibodies can be employed as therapeutic or prophylactic agents. 

In preferred aspects of the invention, the methods can be employed to immunize 
an HTV- 1 -infected individual such that levels of HIV- 1 will be reduced in such individual. 
In another aspect, the methods can be employed to immunize a non-HTV-1 -infected 
individual so that, following a subsequent exposure to HIV-1 that would normally result 
in HIV-1 infection, the levels of HIV-1 will be non-detectable using current diagnostic 
tests. 

Immunogen Preparation 

Induction and interpretation of a humoral immune response directed against gp4 1 
structural epitopes requires both immunogen preparation and antibody characterization. 
Synthetic peptides and recombinant proteins can both be used to generate antigenic 
structures corresponding to gp41 fusion active domains. 
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In one aspect of the invention, target immunogens model the heptad repeat region 
delineated by the P-17 peptide (capable of forming a trimeric coiled-coil structure). In 
another aspect of the invention, target immunogens model the transmembrane-proximal 
. amphipathic a-helical segment delineated by the P- 1 8 peptide. This region in the absence 
5 of the coiled-coil core exhibits random coil solution structure. (Wild, C, etaL, Proc. Natl 
Acad. Sci. USA 59:10537-10541 (1992); Wild, C, et al. t AIDS Res. Hum. Retroviruses 
9:1051-1053 (1993); Wild, C, etal, Proc. Natl Acad. Scl USA 97:9770-9774 (1994)). 
In another aspect, combinations of these target immunogens are employed for raising 
antibodies. 

10 In another aspect of the invention the target immunogen is the six helix 
hydrophobic bundle. This bundle is formed by the specific association of these two distal 
regions of the ectodomain of gp41 (Chan, D. C, et al t Cell 59:263-273 (1997); Lu, 
et al, Nature Struct. Biol. 2:1075-1082 (1995)). These constructs will mimic entry 
- -■ determinants which form and function during HIV- 1 entry. 

1 5 Synthetic Methods of Immunogen Preparation 

Immunogens can be prepared by several different routes. The constructs can be 
generated from synthetic peptides. This involves preparing each sequence as a peptide 
monomer followed by post-synthetic modifications to generate the appropriate oligomeric 
structures. The peptides are synthesized by standard solid-phase methodology. To 

20 generate a trimeric coiled-coil structure, the P-17 peptide monomer is solubilized under 
conditions which favor oligomerization. These conditions include a 20 mM phosphate 
buffer, pH 4.5 and a peptide concentration of 100 fiM (Wild, C, et al, Proc. Natl Acad. 
ScL USA 89: 10537- 10541 (1992)). The structure which forms under these conditions can 
be optionally stabilized by chemical crosslinking, for example using gluteraldehyde. 

25 Alternatively, a protocol which makes use of intermolecular disulfide bond 

formation to stabilize the trimeric coiled-coil structure can be employed in order to avoid 
any disruptive effect the cross-linking process might have on the structural components 
of this construct. This approach uses the oxidation of appropriately positioned cysteine 
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residues within the peptide sequence to stabilize the oligomeric structure. This requires 
the addition of a short linker sequence to the N terminus of the P- 17 peptide. The trimeric 
coiled-coil structure which is formed by this approach will be stabilized by the interaction 
of the cysteine residues (FIG. 2). The trimer is separated from higher order oligomeric 
5 forms, as well as residual monomer, by size exclusion chromatography and characterized 
by analytical ultracentrifugation. These covalently stabilized coiled-coil oligomers serve 
as the core structure for preparation of a six helix bundle. 

To accomplish preparation of a six helix bundle, an excess of P-18 peptide is 
added to the purified core structure. After incubation the reaction mixture is subjected 

10 to a cross-linking procedure to stabilize the higher order products of the specific 
association of these two peptides. The desired material is isolated by size exclusion 
chromatography and characterized by analytical ultracentrifugation. The immunogen 
corresponding only to the P-18 peptide requires no specific post-synthetic modifications. 
Using this approach, three separate target constructs are generated rapidly and injarge 

15 amounts. 

Recombinant Methods of Immunogen Preparation 

Another method for preparing target immunogens involves the use of abacterial 
expression vector to generate recombinant gp41 fragments. The use of an expression 
vector to produce the peptides and polypeptides capable of forming the entry-relevant 

20 immunogens of the present invention adds a level of versatility to immunogen preparation. 

New and modified forms of the antigenic targets are contemplated as the structural 
determinants of HIV-1 entry are better understood. The recombinant approach readily 
accommodates these changes. Also, this method of preparation allows for the ready 
modification of the various constructs (i.e. the addition of T- or B-cell epitopes to the 

25 recombinant gp41 fragments to increase immunogenicity). In addition, a form of the six 
helix hydrophobic core structure is generated which will not require additional 
stabilization, since determining the antigenic nature of this structure is important. Finally, 
these recombinant constructs can be employed as a tool to provide valuable insights into 
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additional structural components which form and function in gp41 during the process of 
virus entry. 

Thus, as part of the invention, novel fusion polypeptides (conjugates) are also 
provided, as are vectors, host cells and recombinant methods for producing the same. The 
5 present invention provides isolated nucleic acid molecules comprising a polynucleotide 
encoding the conjugates of the invention. 

The present invention also relates to recombinant vectors, which include the 
isolated nucleic acid molecules of the present invention, and to host cells containing the 
recombinant vectors, as well as to methods of making such vectors and host cells and for 
10 using them for production of fusion polypeptides or peptides by recombinant techniques. 

The polynucleotides may be joined to a vector containing a selectable marker for 
propagation in a host. Generally, a plasmid vector is introduced in a precipitate, such as 
a calcium phosphate precipitate, or in a complex with a charged lipid. If the vector is a 

virusrit may be packaged in vitro using an appropriate packaging cell line and then 

15 transduced into host cells. 

The DN A insert should be operatively linked to an appropriate promoter, such as 
that described herein. Other suitable promoters will be known to the skilled artisan. The 
expression constructs will further contain sites for transcription initiation, termination and, 
in the transcribed region, a ribosome binding site for translation. The coding portion of 
20 the mature transcripts expressed by the constructs will preferably include a translation 
initiating at the beginning and a termination codon (UAA, UGA or UAG) appropriately 
positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one selectable 
marker. Such markers include dihydrofolate reductase or neomycin resistance for 
25 eukaryotic cell culture and tetracycline or ampicillin resistance genes for culturing in E. 
coli and other bacteria. Representative examples of appropriate hosts include, but are not 
limited to, bacterial cells, such as E. coli, Streptomyces and Salmonella typhimurium cells; 
fungal cells, such as yeast cells; insect cells such as Drosophila S2 and Spodoptera Sf9 
cells; animal cells such as CHO, COS and Bowes melanoma cells; and plant cells. 
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Appropriate culture mediums and conditions for the above-described host cells are known 
in the art. 

Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated 
5 transfection, electroporation, transduction, infection or other methods. Such methods are 

described in many standard laboratory manuals, such as Davis et al t Basic Methods In 
Molecular Biology (1986). 

The polypeptide may be expressed in a modified form, such as a fusion protein, 
and may include not only secretion signals, but also additional heterologous functional 

10 regions. For instance, a region of additional amino acids, particularly charged amino 
acids, may be added to the N- terminus of the polypeptide to improve stability and 
persistence in the host cell, during purification, or during subsequent handling and storage. 
Also, peptide moieties may be added to the polypeptide to facilitate purification. Such 
regions may be removed prior to final preparation of the polypeptide. The addition of 

1 5 peptide moieties to polypeptides to engender secretion or excretion, to improve stability 
and to facilitate purification, among others, are familiar and routine techniques in the art. 

The fusion protein can be recovered and purified from recombinant cell cultures 
by well-known methods including ammonium sulfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, phosphocellulose chromatography, 

20 hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite 
chromatography and lectin chromatography. Most preferably, high performance liquid 
chromatography ("HPLC") is employed for purification. Depending upon the host 
employed in a recombinant production procedure, the polypeptides of the present 
invention may be glycosylated or may be non-glycosylated. In addition, polypeptides of 

25 the invention may also include an initial modified methionine residue, in some cases as a 
result of host-mediated processes. 

A bacterial expression vector (kindly provided by Dr. Terrance Oas, Duke 
University) was developed specifically for the expression of small proteins. This plasmid, 
pTCLE-G2C, is based on pAED-4, a T7 expression vector. A modified TrpLE (Yansura, 

30 D. G., Methods Enzymol 755:161-166 (1990)) fusion peptide (provided by Dr. Peter 
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Kim) was inserted after the T7 promoter (Studier, F. W., et al, Methods Enzymol. 
7S5:60-89 (1990)). There is an in frame Nde I site at the end of the TrpLE peptide that 
encodes a methionine cyanogen bromide (CNBr) cleavage site. This vector was used in 
an earlier study to express a recombinant form of the P-17 peptide (Calderone, T. L., et 
5 al J. Mol Biol 262:401-412 (1996)) and has been modified to expresses the P-18 
peptide. 

To generate a six helix hydrophobic core structure, several combinations of the 
heptad repeat (for example, P-17 or P-15) region and the amphipathic a-helical (for 
example, P- 1 6 or P- 1 8) segment of gp4 1 are separated by a flexible linker of amino acid 
10 residues. For example, (GGGGS) 3 (SEQ ID NO:7) can be encoded into the vector. This 
is accomplished by standard PCR methods. The (GGGGS) 3 (SEQ ID NO:7) linker motif 
is encoded by a synthetic oligonucleotide which is ligated between the P-17 and P-18 
encoding regions of the expression vector. 

" All cbnstmctrohs : ^e 'characterized by multiple restriction e 

1 5 sequencing. The success of this approach to attain multicomponent interactions has been 
recently demonstrated (Huang, B., et al, J. Immunol 255:216-225 (1997)). 

Examples of the novel constructs or conjugates that can be formed by the method 

are described above. 

Based on the parallel orientation of the subunits of the coiled coil core and the 

20 antiparallel orientation of the amphipathic a-helical segment in the six helix bundle, these 
constructs fold to generate the desired structures (See, FIG. 3.). Following expression, 
the recombinant gp41 fragments are isolated as inclusion bodies, cleaved from the leader 
sequence by cyanogen bromide, and separated from the leader by-product by size 
exclusion chromatography step (SUPERDEX 75). This protocol has been successfully 

25 used in the purification of large quantities of a modified form of the P-17 peptide 
(Calderone, T. L., etal, J. Mol Biol 262:407-412 (1996)). Recombinant constructs (2) 
and (3) are mixed in equalmolar quantities under non-denaturing conditions to generate 
a six-helix hydrophobic core structure. Constructs (1) and (4) will fold either intra- or 
intermolecularly to generate the same or similar structures (see FIG. 3 for the folding 

30 process). The desired product is purified by size exclusion chromatography on a 
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SUPERDEX 75 FPLC column and characterized by molecular weight under using a 
Beckman Model XL-A analytical ultracentrifuge. 

Definitions 

The phrase "entry-relevant" as employed herein, refers to particular molecular 
5 conformations or structures that occur or are exposed following interaction of HIV with 
the cell surface during viral entry, and the role of particular amino acid sequences and 
molecular conformations or structures in viral entry. 

The term "neutralizing" as employed herein refers to the ability to inhibit entry of 
HIV into cells, including an amount of inhibition that is useful for reducing or preventing 
10 infection of uninfected cells by the virus. 

The term "HIV" as used herein refers to all strains and isolates of human 
immunodeficiency virus type 1 . The constructs of the invention were based upon HIV- 1 
gp41, and the numbering of amino acids in HIV proteins and fragments thereof given 
herein is with respect to the HTV-1^ isolate. However, it is to be understood, that while 
15 HTV- 1 viral infection and the effects of the present invention on such HIV- 1 infection are 
being used herein as a model system, the entry mechanism that is being targeted is relevant 
to all strains and isolates of HTV-1. Hence the invention is directed to "broadly 
neutralizing" methods. 

The phrase "heptad repeat" or "heptad repeat region" as employed herein, refers 
20 to a common protein motif having a 4-3 repeat of amino acids, commonly leucine and/or 
isoleucine, and is often associated with alpha-helical secondary structure. The 'heptad 
repeat" can be represented by the following sequence: 

-AA, - AA 2 - AA 3 - AA 4 -AA 5 - AA 6 -AA 7 - 

where AA, and AA 4 are each one of leucine or isoleucine; while AA 2 , AA 3 , AA 5 , AA 6 , and 
25 AA 7 can be any amino acid. See, Wild, C, et al t Proc. Natl Acad. ScL USA 89: 10537- 
10541 (1992). 
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Peptides are defined herein as organic compounds comprising two or more amino 
acids covalently joined by peptide bonds. Peptides may be referred to with respect to the 
number of constituent amino acids, i.e., a dipeptide contains two amino acid residues, a 
tripeptide contains three, etc. Peptides containing ten or fewer amino acids may be 
referred to as oligopeptides, while those with more than ten amino acid residues are 
polypeptides. 

Peptides 

The complete gp41 amino acid sequence (HIV-1 Group M: Subtype B Isolate: 
LAI, N to C termini) is: 

AVGIGALFLGFLGAAGSTMGARSMTLTVQARQLLSGIVQQQNNLLEIAIEA 



WSNKSLEQIWNNMTWMEWDREINNYTSLIHSLIEESQNQQEK 

NEQELLELDKWASLWNWF^NWLWYIK^ 
NRVRQGYSPLSFQTHLP-TPRG-PDRPEGIEEEGGERDRDRSIRLVNGSL 
ALIWDDLRSLCLFSYHRLRDLLLrVTRIVELLGRRGWEALKYWW 
NLLQYWS QELKNS AVSLLN ATA1AV AEGTDRVIEV VQG ACRAJRHff RRIRQG 
LERILL. (SEQK>NO:8) 

N-terminal helix region: 
ARQLLSGIVQQQNNIXRAEAQQHIIX^ 

(SEQIDNOil) 

Shown below is the sequence for residues 558-595 (SEQ ID NO:7) of the 
HIV-1^, gp41 protein in the N-helical domain of the protein. The a and d subscripts 
denote the 4-3 positions of the heptad repeat. 

NNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ 
25 d ad ad ad ad a 

571 578 585 (SEQE>NO:2) 
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C-terminal helix region: 

WNNMWMEWDREINNYTSLfflSLffiESQNQQEKNEQELLELDKWASLWNWF 
NTTNW (SEQ ID NO:4) 



Shown below is the amino acid sequence for residues 643-678 of the HIV-1^ 
gp41 protein in the C-helical domain of the protein. 

YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF 
d ad ad a d ad a 

647 654 661 (SEQ ID NO:5) 

Unlike the N-helix, when modeled as a peptide, the C-helical region of gp4 1 is not 
structured. However, when mixed with the N-peptide, the C-peptide does^takes on 
a -helical structure as part of the core structure complex. The structure forms in vitro on 
mixing the peptides and can be characterized spectrophotometrically (Lu, M., etal. 9 Nat 
Struct Biol. 2: 1 075- 1 082 ( 1 995)) . The initial determination of the effect of the mutations 
on C-helix structure may be performed by analyzing the ability of the mutant C-peptide 
to interact with the N-peptide and form the six-helix bundle. This analysis may be carried 
out using circular dichroism. N-helical and C-helical domain peptides can be constructed 
from multiple strains of HIV, and can include deletions, insertions and substitutions that 
do not destroy the ability of the resulting peptide to elicit antibodies when employed alone 
or in combination with other peptides of the invention. 

Examples of N-helical Domain Peptide Sequences (All sequences are listed from 
N-terminus to C-terminus.) from different HIV strains include, but are not limited to the 
following peptides: 



HIV-1 Group M: Subtype B Isolate: LAI 
ARQU^GIVQQQNNIlitAEAQQ 

(SEQ ID NO: 1) 
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SGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ 

(SEQ ID NO:9) 

P15 SGIVQQQNNLLRAffiAQQHLLQLTVWGIKQLQARIL (SEQIDNO:3) 
P-17 NNLLRAIEAQQHLLQLTVWGIKQLQARILAVERYLKDQ 
5 (SEQ ID NO:2) 

Subtype B Isolate: ADA 

SGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARVLALERYLRDQ 

(SEQ ID NO: 10) 

10 SGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARVL (SEQ ID NO: 1 1) 

NNLLRAIEAQQHLLQLTVWGIKQLQARVLALERYLRDQ (SEQ ID NO: 12) 

Subtype B Isolate: JRFL 

SGIVQQQNNLLRAIEAQQRMLQLTVWGIKQLQARVLAVERYLGDQ 

(SEQ ID NO: 13) 

15 SGIVQQQNNLLRAIEAQQRMLQLTVWGIKQLQARVL (SEQ ID NO: 14) 

NNLLRAIEAQQRMLQLTVWGIKQLQARVLAVERYLGDQ (SEQ ID NO:15) 

Subtype B Isolate: 89.6 

SGIVQQQNNLLRAJEAQQHMLQLTVWGIKQLQARVLALERYLRDQ 

(SEQ ID NO: 16) 

20 SGIVQQQNNLLRAIEAQQHMLQLTVWGIKQLQARVL (SEQ ID NO: 17) 

NNLLRAIE AQQHMLQLTVWGDCQLQARVLALERYLRDQ (SEQ ID NO: 18) 

Subtype C Isolate: BU9 1 08 12 

SGIVQQQSNLLRAIEAQQHMLQLTVWGIKQLQARVLAIERYLRDQ 

(SEQ ID NO: 19) 

25 SGIVQQQSNLLRAEAQQHMLQLTVWGIKQLQARVL (SEQIDNO:20) 
S NLLR AIE AQQHMLQLTVWGIKQLQ AR VLAIER YLRDQ (SEQ ID NO:21) 
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Subtype D Isolate: 92UG024D 

SGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARVLAVESYLKDQ 

(SEQ ID NO:22) 

SGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARVL (SEQ ID NO: 1 1 ) 

5 NNLLRAIEAQQHLLQLTVWGIKQLQARVLAVESYLKDQ (SEQIDNO:23) 

Subtype F Isolate: BZ163A 

SGIVQQQSNLLRAIEAQQHLLQLTVWGIKQLQARVLAVERYLQDQ 

(SEQ ID NO:24) 

SGIVQQQSNLLRAIEAQQHLLQLTVWGIKQLQARVL (SEQ ID NO:25) 

10 SNLLRAIEAQQHLLQLTVWGIKQLQARVLAVERYLQDQ (SEQ ID NO:26) 

Subtype G Isolate: FI.HH8793 

SGIVQQQSNLLRAIEAQQHLLQLTVWGIKQLQARVLAIJERYLRDQ 

(SEQ ID NO:27) 

SGIVQQQSNLLRAIEAQQHLLQLTVWGIKQLQARVL (SEQ ID NO:25) 

15 SNLLRAIEAQQHLLQLTVWGIKQLQARVLALERYLRDQ (SEQ ID NO:28) 

Subtype H Isolate: BE.VI997 

SGIVQQQSNLLRAIQAQQHMLQLTVWGVKQLQARVLAVERYLKDQ 

(SEQ ID NO:29) 

SGIVQQQSNLLRAIQAQQHMLQLTVWGVKQLQARVL (SEQ ID NO:30) 

20 SNLLRAIQAQQHMLQLTVWGVKQLQARVLAVERYLKDQ (SEQIDNO:31) 

Subtype J Isolate: SE.SE92809 

SGIVQQQSNLLKAIEAQQHLLKLTVWGIKQLQARVLAVERYLKDQ 

(SEQ ID NO:32) 

SGIVQQQSNLLKAIEAQQHLLKLTVWGIKQLQARVL (SEQ ID NO:33) 

25 SNLLKAIEAQQHLLKLTVWGIKQLQARVLAVERYLKDQ (SEQ ID NO:34) 
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Group N Isolate: CM.YBF30 

SGIVQQQNILLRAEAQQHLLQLSIWGIKQLQAKVLAIERYLRDQ 

(SEQIDNO:35) 

SGIVQQQNILLRAIEAQQHLLQLSIWGIKQLQAKVL (SEQ ID NO:36) 

5 NILLRAEAQQHLLQLSIWGIKQLQAKVLAIERYLRDQ (SEQ ID NO:37) 

Group O Isolate: CM.ANT70C 

KGP/QQQDNLLRAIQAQQQLLRLSxWGIRQLRARLLALETLLQNQ 

(SEQ ID NO:38) 

KGIVQQQDNLLRAIQAQQQLLRLSxWGIRQLRARL (SEQ ID NO:39) 

10 DNLLRAIQAQQQLLRLSxWGIRQLRARLLALETLLQNQ (SEQ ID NO:40) 

Examples of C-helical Domain Peptide Sequences (All sequences are listed from 
N-terminus to C-terminus.) from different HTV strains include, but are not limited to the 
following peptides: 

HTV- 1 Group M: Subtype B Isolate: LAI 
15 WNNMTW^WDREINNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF 
NITNW (SEQIDNO:4) 
WMEWDREINNYTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF 

(SEQ ID NO:41) 

P 1 6 WMEWDREINNYTSLIHSLIEESQNQQEKNEQELL (SEQ ID NO:6) 

20 P-18 YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF 

(SEQ ID NO:5) 

Subtype B Isolate: ADA 

W M E \V E REIEN YTGIJYTLIEES QNQQEKNEQDLLALDKWASLWNWF 

(SEQ ID NO:42) 

25 WMEWEREIENYTGLIYTLIEESQNQQEKNEQDLL (SEQIDNO:43) 
YTGLHTLIEESQNQQEKNEQDU^ALDKWASLWNWF (SEQ ID NO:44) 
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Subtype B Isolate: JRFL 

WMEWEREIDNYTSEIYTLffiESQNQQEKNEQELLELDKWASLWNWF 

(SEQ ID NO:45) 

WMEWEREIDNYTSEIYTLIEESQNQQEKNEQELL (SEQ ID NO:46) 

YTSEIYTLIEES QNQQEKNEQELLELDKW AS LWNWF (SEQ ID NO:47) 

Subtype B Isolate: 89.6 

WMEWEREIDNYTDYIYDLLEKSQTQQEKNEKELLELDKWASLWNWF 

(SEQ ID NO:48) 

WMEWEREIDN YTD YIYDLLEKS QTQQEKNEKELL (SEQ ED NO:49) 

YTD YIYDLLEKS QTQQEKNEKELLELDKW ASLWNWF (SEQ ID NO:50) 

Subtype C Isolate: BU910812 

WIQWDREISNYTGUYRLLEESQNQQENNEKDLLALDKWQNLWSWF 

(SEQIDNO:51) 

WIQWDREISNYTGEYRLLEESQNQQENNEKDLL (SEQ ID NO:52) 

YTGHYRLLEES QNQQENNEKDLLALDKWQNLWS WF (SEQ ID NO:53) 

Subtype D Isolate: 92UG024D 

WMEWEREISN YTGLIYDLIEES QIQQEKNEKDLLELDKWASLWNWF 

(SEQ ID NO:54) 

WME WEREIS N YTGLIYDLIEES QIQQEKNEKDLL (SEQ ED NO:55) 

YTGLIYDLIEES QIQ QEKNEKDLLELDK W AS LWNWF (SEQ ID NO:56) 

Subtype F Isolate: BZ163A 

WMEWQKEISNYSNEVYRLIEKSQNQQEKNEQGLLAJLDKWASLWNWF 

(SEQ ID NO:57) 

WMEWQKEISNYSNEVYRLB3KSQNQQEKNEQGLL (SEQ ID NO:58) 

YSNEVYRLIEKSQNQQEKNEQGLLALDKW ASLWNWF (SEQ ID NO:59) 
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Subtype G Isolate: FI.HH8793 

WIQWDREISNYTQQIYSLIEESQNQQEKNEQDLLALDNWASLWTWF 

(SEQIDNO:60) 

WIQWDREISNYTQQIYSLEESQNQQEKNEQDLL (SEQ ID NO:61) 

5 YTQQIYSLIEES QNQQEKNEQDLLALDNW ASLWTWF (SEQE)NO:62) 

Subtype H Isolate: BE.VI997 

WMEWDRQmNYTEVIYRLLELSQTQQEQNEQDLLALDKWDSLWNWF 

(SEQIDNO:63) 

WMEWDRQEDNYTEVIYRLLELSQTQQEQNEQDLL (SEQ ID NO:64) 

10 YTEVIYRLLELSQTQQEQNEQDLLALDKWDSLWNWF (SEQIDNO:65) 

Subtype J Isolate: SE.SE92809 



(SEQIDNO:66) 

WIQWEREINNYTGHYSLIEEAQNQQENNEKDLL (SEQ ID NO:67) 

15 YTGIIYSLffiEAQNQQENNEKDIJ^DKWTNLWNWFN (SEQK)NO:68) 

Group N Isolate: CM.YBF30 

WQQWDEKVRNYSGVIFGLIEQAQEQQNTNEKSLLELDQWDSLWSWF 

(SEQIDNO:69) 

WQQWDEKVRNYSGVIFGLIEQAQEQQNTNEKSLL (SEQIDNO:70) 
20 YSGVIFGLJEQAQEQQNTNEKSLLELDQWDSLWSWF (SEQ ID NO:71) 



Group O Isolate: CM.ANT70C 

WQEWDRQISNISSTrreEIQKAQVQQEQNEKKLLELDEWASIWNWL 

(SEQIDNO:72) 

WQEWDRQISNISSTIYEEIQKAQVQQEQNEKKLL (SEQ ID NO:73) 

25 ISSTIYEEIQKAQVQQEQNEKKLLELDEWASIWNWL (SEQIDNO:74) 
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- The peptides and conjugates of the present invention may be acylated at the NH 2 
terminus, and may be amidated at the COOH terminus. 

The peptides and conjugates of the invention may include conservative amino acid 
substitutions. Conserved amino acid substitutions consist of replacing one or more amino 
5 acids of the peptide sequence with amino acids of similar charge, size, and/or 
hydrophobicity characteristics, such as, for example, a glutamic acid (E) to aspartic acid 
(D) amino acid substitution. When only conserved substitutions are made, the resulting 
peptide is functionally equivalent to the peptide from which it is derived. 

Peptide sequences defined herein are represented by one-letter symbols for amino 



10 


acid residues as follows: 








A 


alanine 


L 


leucine 




R 


arginine 


K 


lysine 




N 


asparagine 


M 


methionine 




D 


aspartic acid 


F 


phenylalanine 


15 


C 


cysteine 


P 


proline 




Q 


glutamine 


S 


serine 




E 


glutamic acid 


T 


threonine 




G 


glycine 


W 


tryptophan 




H 


histidine 


Y 


tyrosine 


20 


I 


isoleucine 


V 


valine 



The peptides and conjugates of the invention may include amino acid insertions 
which consist of single amino acid residues or stretches of residues ranging from 2 to 15 
amino acids in length. One or more insertions may be introduced into the peptide, peptide 
fragment, analog and/or homolog. 
25 The peptides and conjugates of the invention may include amino acid deletions of 

the full length peptide, analog, and/or homolog. Such deletions consist of the removal of 
one or more amino acids from the full-length peptide sequence, with the lower limit length 
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of the resulting peptide sequence being 4 to 6 amino acids. Such deletions may involve 
a single contiguous portion or greater than one discrete portion of the peptide sequences. 

The peptides of the invention may be synthesized or prepared by techniques well 
known in the art. See, for example, Creighton, Proteins: Structures and Molecular 

5 Principles, W.H. Freeman & Co., New York, NY (1983), which is incorporated herein 
by reference in its entirety. Short peptides, for example, can be synthesized as a solid 
support or in solution. Longer peptides maybe made using recombinant DN A techniques. 
Here, the nucleotide sequences encoding the peptides of the invention maybe synthesized, 
and/or cloned, and expressed according to techniques well known to those of ordinary 

10 skill in the art. See, for example, Sambrook, et al. y Molecular Cloning, A Laboratory 
Manual, Vols. 1-3, Cold Spring Harbor Press, Cold Spring Harbor, NY (1989). 

In yet another embodiment of the invention, peptides comprising the sequences 
described above may be synthesized with additional chemical groups present at their 
- amino and/or carboxy termini, such that* for example r the stability, bioavailability, and/or 

1 5 immunogenic activity of the peptides is enhanced. For example, hydrophobic groups such 
as carbobenzoxy, dansyl, or t-butyloxycarbonyl groups, may be added to the peptides' 
amino termini. Likewise, an acetyl group or a 9-fluorenylmethoxy-carbqnyl group may 
be placed at the peptides' amino termini. Additionally, the hydrophobic group 
t-butyloxycarbonyl, or an amido group may be added to the peptides' carboxy termini. 

20 In one preferred embodiment, carrier proteins, such as keyhole limpet hemocyanin, 
ovalbumin, BSA or tetanus toxoid are added to the peptide. 

With reference to the peptides P-17 and P-18, deletion mutants are further 
described. 

The peptide P-18 corresponds to amino acid residues 638 to 673 of the 
25 transmembrane protein gp41 from the HIV-1^ isolate: 

In addition to the full-length C-helical peptides identified above, useful peptides 
of the invention may include truncations of the C-helical peptides (SEQ ID NO:4) which 
exhibit the ability to raise neutralizing antibodies or form a six-helix hydrophobic core 
structure under conditions described herein. Such truncated peptides may comprise 
30 peptides of between 3 and 56 amino acid residues, i.e., peptides ranging in size from a 
tripeptide to a 56-mer polypeptide. As an example, such peptides are listed for P-18 in 
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Tables I and II, below. Peptide sequences in these tables are listed from amino (left) to 
carboxy (right) terminus. "X" may represent an amino group (-NH 2 ) and "Z" may 
represent a carboxyl (-COOH) group. Alternatively, as described below, "X" and/or "Z" 
may represent a hydrophobic group, an acetyl group, a FMOC group, an amido group, 
or a covalently attached macromolecule. 



10 



15 



20 



25 



30 



TABLE I 



Carboxy Truncations of SEQ ID NO;5 



X-YTS-Z 

X-YTSL-Z 

X-YTSLI-Z 

X-YTSLIH-Z 

X-YTSLIHS-Z 

X-YTSLIHSL-Z 

X-YTSLIHSLI-Z 

X-YTSLIHSLIE-Z 

X-YTSLIHSLIEE-Z 

X-YTSLIHSLIEES-Z 

X-YTSLIHSLIEESQ-Z 

X-YTSLIHSLIEESQN-Z 

X-YTSLIHSLIEESQNQ-Z 

X-YTSLIHSLIEESQNQQ-Z 

X-YTSLIHSLIEESQNQQE-Z 

X- YTSLIHSLIEESQNQQEK- Z 

X-YTSLIHSLIEESQNQQEKN- Z 

X-YTSLIHSLIEESQNQQEKNE-Z 

X-YTSLIHSLIEESQNQQEKNEQ-Z 

X -YTSLIHSLIEESQNQQEKNEQE - Z 

X-YTSLIHSLIEESQNQQEKNEQEL-Z 

X-YTSLIHSLIEESQNQQEKNEQELL-Z 

X-YTSLIHSLIEESQNQQEKNEQELLE-Z 

X-YTSLIHSLIEESQNQQEKNEQELLEL-Z 

X- YTSLIHSLIEESQNQQEKNEQELLELD- Z 

X- YTSLIHSLIEESQNQQEKNEQELLELDK- Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDKW-Z 
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X-YTSLIHSLIEESQNQQEKNEQELLELDKWA-Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDKWAS-Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDKWASL-Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDKWASLW-Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWN-Z 

X-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNW-Z 

X-YTSL IHSL I EESQNQQEKNEQELLELDKWASLWNWF - Z . 

The one letter amino acid code is used. 
"X" may represent a hydrogen attached to the terminal amino 
group, an amino protecting group including, but not limited to, 
carbobenzoxyl, dansyl, or t-butyloxycarbonyl; an acetyl group; a 
9-fluorenylmethoxy-carbonyl (FMOC) group; a macromolecular carrier 
group including, but not limited to, lipid-fatty acid conjugates, polyethylene 
glycol, or carbohydrates. 

"Z ,! may represent a terminal carboxyl (COOH); an amido group; 
an ester group (COOR) including, but not limited to, a t-butyloxycarbonyl 
group; a macromolecular carrier group including, but not limited to, lipid- 
fatty acid conjugates, polyethylene glycol, or carbohydrates. 
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TABLE II 



Amino Truncations of SEQ ID NO:5 



X-NWF-Z 

X-WNWF-Z 

X-LWNWF-Z 

X-SLWNWF-Z 

X-ASLWNWF-Z 

X-WASLWNWF-Z 

X-KWASLWNWF-Z 

X-DKWASLWNWF-Z 

X - LDKWAS LWNWF - Z 

X - ELDKWASLWNWF - Z 

X - LELDKWASLWNWF - Z 

X - LLELDKWASLWNWF - Z 

X-ELLELDKWASLWNWF-Z 

X -QELLELDKWASLWNWF - Z 

X - EQELLELDKWASLWNWF - Z 

X-NEQELLELDKWASLWNWF-Z 

X-KNEQELLELDKWASLWNWF-Z 

X-EKNEQELLELDKWASLWNWF - Z 

X- QEKNEQELLELDKWASLWNWF - Z 

X- QQEKNEQELLELDKWASLWNWF - Z 

X-NQQEKNEQELLELDKWASLWNWF-Z 

X-QNQQEKNEQELLELDKWASLWNWF- Z 

X- SQNQQEKNEQELLELDKWASLWNWF - Z 

X - ESQNQQEKNEQELLELDKWAS LWNWF - Z 

X-EESQNQQEKNEQELLELDKWASLWNWF-Z 

X - 1 EE SQNQQEKNEQELLELDKWAS LWNWF - Z 

X - L I EE S QNQQEKNEQEL LELDKWAS LWNWF - Z 

X-SLIEESQNQQEKNEQELLELDKWASLWNWF-Z 

X-HSLIEESQNQQEKNEQELLELDKWASLWNWF- Z 

X-IHSLIEESQNQQEKNEQELLELDKWASLWNWF-Z 

X-LIHSLIEESQNQQEKNEQELLELDKWASLWNWF-Z 

X- SLIHSLIEESQNQQEKNEQELLELDKWASLWNWF- Z 
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10 



X-TSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-Z 
X-YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-Z 



The one letter amino acid code is used. 

"X" may represent a hydrogen attached to the terminal amino 
group, an amino protecting group including, but not limited to, 
carbobenzoxyl, dansyl, or t-butyloxycarbonyl; an acetyl group; a 
9-fluorenylmethoxy-carbonyl (FMOC) group; a macromolecular carrier 
group including, but not limited to, lipid-fatty acid conjugates, polyethylene 
glycol, or carbohydrates. 

"Z n may represent a terminal carboxyl (COOH); an amido group; 
an ester group (COOR) including, but not limited to, a t-butyloxycarbonyl 
group; a macromolecular.carrier group including, but not limited to, lipid- 
fatty acid conjugates, polyethylene glycol, or carbohydrates. 



The peptides also include analogs of which may include, but are not limited to, 
1 5 peptides comprising the a full-length or truncated sequence, containing one or more amino 
acid substitutions, insertions and/or deletions. 

_ . There exists a striking amino acid conservation within the Crbelical regions of 

HIV- 1 and HIV-2. The amino acid conservation is of a periodic nature, suggesting some 
conservation of structure and/or function. A useful peptide derived from the HTV-l^^ 
20 isolate has the 36 amino acid sequence (reading from amino to carboxy terminus): 

NH 2 -LEAMSQSLEQAQIQQEK]NM (SEQ ID NO:5) 

Further, peptides useful for forming "entry-relevant" structures include peptides 
corresponding to the N-helical domain of gp41. One example of such a peptide, P-17, 
corresponds to residues 558 to 595 of the transmembrane protein gp41 from the HTV-1^ 
25 isolate. 

In addition to the full-length N-helical peptides (for example, (SEQ ID NO:l)) 
shown above, the peptides may include truncations of these peptides which exhibit the 
ability to form stable coiled-coil structure. Such truncated peptides may comprise 
peptides of between 3 and 55 amino acid residues, i.e., peptides ranging in size from a 
30 tripeptide to a 55-mer polypeptide, as shown in Tables III and IV, below for P-17. 
Peptide sequences in these tables are listed from amino (left) to carboxy (right) terminus. 
"X" may represent an amino group (-NH 2 ) m ^ "Z" may represent a carboxyl (-COOH) 



BNSDOCID: <WO 00406 16A1J_> 



WO 00/40616 



PGT/USOO/00456 



-35- 



group. Alternatively, "X" and/or "Z" may represent a hydrophobic group, an acetyl 
group, a FMOC group, an amido group or a covalently attached macromolecular group. 
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TABLE III 



Carboxy Truncations of SEQ ID NO:2 



X-NNL-Z 

X-NNLL-Z 

X-NNLLR-Z 

X-NNLLRA-Z 

X -NNLLRAI - Z 

X-NNLLRAIE-Z 

X-NNLLRAIEA-Z 

X -NNLLRAI EAQ - Z 

X-NNLLRAIEAQQ-Z 

X -NNLLRAI EAQQH - Z 

X -NNLLRAI EAQQHL-Z 

X-NNLLRAIEAQQHLL-Z 

X -NNLLRAIEAQQHLLQ - Z 

X -NNLLRAIEAQQHLLQL - Z 

X-NNLLRAIEAQQHLLQLT- Z ' 

X-NNLLRAIEAQQHLLQLTV-Z 

X-NNLLRAIEAQQHLLQLTVW-Z 

X -NNLLRAI EAQQHLLQLTVWQ- Z 

X -NNLLRAI EAQQHLLQLTVWQ I - Z 

X -NNLLRAI EAQQHLLQLTVWQ I K- Z 

X-NNLLRAIEAQQHLLQLTVWQIKQ- Z 

X-NNLLRAIEAQQHLLQLTVWQIKQL- Z 

X-NNLLRAIEAQQHLLQLTVWQIKQLQ- Z 

X -NNLLRAI EAQQHLLQLTVWQIKQLQA- Z 

X-NNLLRAIEAQQHLLQLTVWQIKQLQAR- Z 

X -NNLLRAIEAQQHLLQLTVWQIKQLQARI - Z 
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X -NNLLRAI EAQQHLLQ LTVWQ I KQLQARI L - Z 
X-NNLLRAIEAQQHLLQLTVWQIKQLQARILA-Z 
X-NNLLRAIEAQQHLLQLTVWQ I KQLQARI LAV- Z 
X-NNLLRAIEAQQHLLQLTVWQIKQLQARILAVE- Z 
X-NNLLRAIEAQQHLLQLTVWQIKQLQARILAVER-Z 
X -NNLLRAI EAQQHLLQLTVWQ I KQLQARILAVERY- Z 
X-NNLLRAIEAQQHLLQLTVWQIKQLQARILAVERYL-Z 
X -NNLLRAIEAQQHLLQLTVWQ I KQLQARI LAVERYLK- Z 
X -NNLLRAI EAQQHLLQLTVWQ IKQLQARILAVERYLKD- Z 
X -NNLLRAI EAQQHLLQLTVWQ I KQLQAR I LAVERYLKDQ - Z 



The one letter amino acid code is used. 

"X" may represent a hydrogen attached to the terminal amino 
group, an amino protecting group including, but not limited to, 
carbobenzoxyl, dansyl, or t-butyloxycarbonyl; an acetyl group; a 
9-fluorenylmethoxy-carbonyl (FMOC) group; a macromolecular carrier 
group including, but not limited "to, lipid-fatty acid conjugates, 
polyethylene glycol, or carbohydrates. 

"Z" may represent a terminal carboxyl (COOH); an amido 
group; an ester group (COOR) including, but not limited to, a 
t-butyloxycarbonyl group; a macromolecular carrier group including, but 
not limited to, lipid-fatty acid conjugates, polyethylene glycol, or 
carbohydrates. 
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TABLE IV 



Amino Truncations of SEQ ID NO:2 



X-KDQ-Z 
X-LKDQ-Z 
X- YLKDQ -Z 
X-RYLKDQ-Z 
X - ER YLKDQ -Z 
X - VERYLKDQ - Z 
X - AVER YLKDQ - Z 
X - LAVERYLKDQ - Z 
X - 1 LAVER YLKD Q - Z 
X - R I LAVER YLKDQ - Z 
X-ARILAVERYLKDQ-Z 
X - Q ARI LAVERYLKDQ - Z 
X - LQAR I LAVERYLKDQ - Z 
X - QLQARI LAVERYLKDQ - Z 
X - KQLQ AR I LAVERYLKDQ - Z 
X - IKQLQARI LAVERYLKDQ - Z 
X - Q IKQLQARI LAVERYLKDQ - Z 
X -WQ IKQLQARI LAVERYLKDQ - Z 
X - VWQ IKQLQARI LAVERYLKDQ - Z 
X-TVWQ I KQLQAR I LAVERYLKDQ -Z 
X - LTVWQ I KQLQAR I LAVERYLKDQ - Z 
X-QLTVWQIKQLQARILAVERYLKDQ- Z 
X-LQLTVWQIKQLQARILAVER YLKDQ- Z 
X - LLQLTVWQ IKQLQARI LAVERYLKDQ - Z 
X-HLLQLTVWQIKQLQARI LAVERYLKDQ- Z 
X - QHLLQ LTVWQ I KQLQAR I LAVERYLKDQ - Z 
X - Q QHLLQ LTVWQ I KQLQARI LAVERYLKDQ - Z 
X-AQQHLLQLTVWQIKQLQARILAVERYLKDQ-Z 
X - EAQQHLLQLTVWQIKQLQARI LAVERYLKDQ - Z 
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X - 1 EAQQHLLQLTVWQ IKQLQARI LAVERYLKDQ - Z 
X-AIEAQQHLLQLTVWQIKQLQARILAVERYLKDQ-Z 
X-RAIEAQQHLLQLTVWQIKQLQARILAVERYLKDQ- Z 
X- LRAIEAQQHLLQLTVWQIKQLQARI LAVERYLKDQ - Z 
X - LLRAI EAQQHLLQLTVWQ IKQLQAR I LAVERYLKDQ - Z 
X-NLLRAIEAQQHLLQLTVWQIKQLQARILAVERYLKDQ-Z 
X -NNLLRAI EAQQHLLQLTVWQIKQLQARI LAVERYLKDQ - Z 



The one letter amino acid code is used. 

"X" may represent a hydrogen attached to the terminal amino 
group, an amino protecting group including, but not limited to, 
carbobenzoxyl, dansyl, or t-butyloxycarbonyl; an acetyl group; a 
9-fluorenylmethoxy-carbonyl (FMOC) group; a macromolecular carrier 
group including, but not limited to, Lipid-fatty acid conjugates, 
polyethylene glycol, or carbohydrates. 

"Z" may represent a terminal carboxyl (COOH); an amido 
group; an ester group (COOR) including, but not limited to, a 
t-butyloxycarbonyl group; a macromolecular carrier group including, but 
not limited to, lipid-fatty acid conjugates,-polyethylene glycol,- or 
carbohydrates. 



The N-helical peptides also include analogs and/or truncations which may 
include, but are not limited to, peptides comprising the full-length or a truncated 
sequence, containing one or more amino acid substitutions, insertions and/or deletions. 



Antibody Generation and Characterization 



Generation and characterization of the antibodies generated against novel gp41 
25 epitopes constitutes the second aspect of the invention. The experimental sera and 
monoclonal antibodies generated against the target immunogens are subjected to 
thorough biophysical and biological evaluation. 

Antibodies are generated following established protocols. All small animal work 
(immunizations, bleeds, and hybridoma production) is carried out by standard methods 
30 known to those of skill in the art. A first set of immunogens consists of the peptide 
constructs P-15 or P-17 (capable of forming trimeric coiled-coil multimers, optionally 
stabilized by chemical cross-linking or oxidation), P-16 or P-18, and the P-17/P-18 
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mixture or P-15/P-16 mixture (wherein the peptides are optionally chemically or 
oxidatively cross-linked). In one set of experiments, the immunogens are conjugated to 
a carrier such as KLH. 

Balb-c mice are immunized with each of these constructs. Due to possible 
5 disruptive effects of carrier conjugation on antigen structure, one group of mice from 
each set can be immunized with 100 jug of unconjugated peptide, while another group 
of mice can receive 100 jug of antigen conjugated to KLH. Following the initial 
immunization the animals receive a 100 fig boost on day 14 followed by 50 |ig boosts on 
days 30 and 45. Bleeds occur two weeks following the final boost. Mice are also 

10 immunized with the recombinant constructs following the same outline as that for the 
peptide immunogens. 

Alternative immunization approaches include the use of a recombinant 
adenovirus vector expressing all or part of the HIV- 1 envelope glycoprotein gp 120/gp41 
as the primary immunogen followed by booster immunizations with the gp41 peptides, 

15 proteins or other constructs. 

The polyclonal sera generated by the immunization of experimental animals 
undergo an initial screen for virus inhibition. Antiviral activity is evaluated in both 
cell-cell fusion and neutralization assays. In this second format, a representative sample 
of lab adapted and primary virus isolates is used. Both assays are carried out according 

20 to protocols described previously (Wild, C, et al, Proc. Natl Acad. ScL USA 89: 10537- 
10541 (1992); Wild, C, et al, Proc. Natl Acad. ScL USA 91: 12676-12680 (1994); 
Wild, C, et al, Proc. Natl Acad. ScL USA 97:9770-9774 (1994)). Samples are also 
screened by EL1S A to characterize binding. The antigen panel includes all experimented 
immunogens. Animals with sera samples which test positive for binding to one or more 

25 experimental immunogens are candidates for use in MAb production. Following this 
initial screen, one animal representing each experimental immunogen is selected for 
monoclonal antibody production. The criteria for this selection is based on neutralizing 
antibody titers and in the absence of neutralization, binding patterns against the panel of 
structured immunogens. 

30 Hybridoma supernatants are screened by ELISA, against structured and 

non-structured peptides and recombinants. Samples that are ELISA negative or weakly 
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positive are further characterized for IgG. If IgG is present the material is screened in 
the biophysical and biological assays. Strongly positive samples are screened for their 
ability to neutralize virus and bind envelope. The experimental material can be further 
tested against a panel representing the spectrum of HIV-1 isolates. These isolates 
5 include lab adapted and primary virus strains, syncytium and non-syncytium inducing 
isolates, virus representing various geographic subtypes, and viral isolates which make 
use of the range of second receptors during virus entry. These neutralization assays 
employ either primary cell and cell line targets as required. 

Antibodies are characterized in detail for their ability to bind HIV envelope under 
10 various conditions. It is another object of the invention to determine the gp41 target 
epitopes are exposed on native envelope or if the envelope must first undergo some 
interaction which triggers a conformational change i.e binding CD4 and/or co-receptor 
in order to expose these epitopes. For detection of antibody binding to native envelope, 

_ immunoprecipitations on Env-expressing cells and virions^-both intact and lysed are- 

15 performed using non-ionic detergents (Furata, RA et aU Nat Struct. Biol 5(^:276-279 
(1997); White, J. M. and I. A. Wilson, /. Cell Biol 705:2887-2894 (1987); Kemble, G. 
W., et aL J. Virol 66:4940-4950 (1992)). Antibody binding to cell lysates and intact 
virions are also assayed in an ELISA format. Flow cytometry experiments are 
performed to determine binding to envelope expressing cells. Cross-competition 
20 experiments using other mapped Mabs, human sera, and peptides can also be performed. 
To characterize "triggers" to the conformational change, antibody binding to virus in the 
presence and absence of both sCD4 and target cells can be compared (White, J. M. and 
I. A. Wilson, J. Cell Biol 705:2887-2894 (1987); Kemble, G. W., et al, J. Virol 
66:4940-4950 (1992)). Because the gp4 1 regions are highly conserved, epitope exposure 
25 using several different envelopes can be compared to discern possible differences in 
structure between primary, lab-adapted and genetically diverse virus isolates. 

Pharmaceutical Compositions and Methods of Using 

The immunogenic constructs of the present invention can be employed in 
vaccines in an amount effective depending on the route of administration. Although 
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subcutaneous or intramuscular routes of administration are preferred, peptides, multimers 
or peptide conjugates of the present invention can also be administered by an 
intraperitoneal or intravenous route. One skilled in the art will appreciate that the 
amounts to be administered for any particular treatment protocol can be readily 
5 determined without undue experimentation. 

The vaccines of the present invention may be employed in such forms as 
capsules, liquid solutions, suspensions or elixirs for oral administration, or sterile liquid 
forms such as solutions or suspensions. Any inert carrier is preferably used, such as 
saline, phosphate-buffered saline, or any such carrier in which the conjugate vaccine has 

10 . suitable solubility properties. The vaccines may be in the form of single dose 
preparations or in multi-dose flasks which can be used for mass vaccination programs. 
Reference is made to Remington's Pharmaceutical Sciences, Osol, ed., Mack Publishing 
Co., Easton, PA (1980), and New Trends and Developments in Vaccines, Voller, et al. 9 
eds., University Park Press, Baltimore, MD (1978), for methods of preparing and using 

15 vaccines. 

The vaccines of the present invention may further comprise adjuvants which 
enhance production of HIV-specific antibodies. Such adjuvants include, but are not 
limited to, various oil formulations such as Freund's complete adjuvant (CFA), stearyl 
tyrosine (ST, see U.S. Patent No. 4,258,029), the dipeptide known as MDP, saponins and 

20 saponin derivatives, such as Quil A and QS-21, aluminum hydroxide, and lymphatic 
cytokine. Preferably, an adjuvant will aid in maintaining the secondary and quaternary 
structure of the immunogens. 

Freund's adjuvant is an emulsion of mineral oil and water which is mixed with 
the immunogenic substance. Although Freund's adjuvant is powerful, it is usually not 

25 administered to humans. Instead, the adjuvant alum (aluminum hydroxide) or ST may 
be used for administration to a human. The vaccine may be absorbed onto the aluminum 
hydroxide from which it is slowly released after injection. The vaccine may also be 
encapsulated within liposomes according to Fullerton, U.S. Patent No. 4,235,877, or 
mixed with or liposomes or lipid mixtures to provide an environment similar to the cell 

30 surface environment. 
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In another preferred embodiment, one or more immunogens of the invention are 
combined with other immunogens that are used to vaccinate animals. 

In another preferred embodiment, the present invention relates to a method of 
inducing an immune response in an animal comprising administering to the animal the 
5 vaccine of the invention in an amount effective to induce an immune response. 
Optionally, the vaccine of the invention may be coadministered with effective amounts 
of other immunogens as mentioned above to generate multiple immune responses in the 
animal. 

Compositions of the invention are useful as vaccines to induce active immunity 
10 towards antigens in subjects. Any animal that may experience the beneficial effects of 
the compositions of the present invention within the scope of subjects that may be 
treated. The subjects are preferably mammals, and more preferably humans. 

The administration of the vaccine may be for either a "prophylactic"" or "therape- 
„_uti£ , \purpose._.^^ 

15 of any symptoms of HIV infection, or in advance of any known exposure to HIV. The 
prophylactic administration of the vaccine(s) serves to prevent or attenuate any 
subsequent infection. When provided therapeutically, the vaccine(s) is provided upon or 
after the detection of symptoms which indicate that an animal may be infected with HIV, 
or upon or after exposure to the virus. The therapeutic administration of the vaccine(s) 

20 serves to attenuate any actual infection, for example as measured by improving the 
symptoms of a subject, or by reducing the level of viral replication. Thus, the vaccines, 
may be provided either prior to the onset of infection (so as to prevent or attenuate an 
anticipated infection) or after the initiation of an actual infection. 
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Example 1 

Immunogens of the Present Invention Elicit a Neutralizing Antibody Response to 
Entry-Relevant Structures on HIV-1 gp41 

Materials and Methods 

5 Polyclonal sera can be obtained by immunizing rabbits or guinea pigs using 

methods well known to those skilled in the art. For example, the animals are immunized 
at multiple sites (sub-cutaneous and sub-clavicular) with a total of 200 fig (rabbits) or 
100 (ig (guinea pigs) of the appropriate peptide, protein, combination of construct in 
complete Freund's adjuvant. This is followed by two booster immunizations with the 
10 same immunogen in incomplete Freund's adjuvant at monthly intervals following the 
primary immunization. Sera are collected prior to, and at intervals following, the series 
of immunizations. These sera are analyzed for the presence of antibodies to the 
immunogen or other antigens by various assays including those described below. 

Peptide ELISA: Antigen was coated onto 96- well microtiter plates (Immulon 2) at 1 
15 pg/well. Following overnight incubation at 4 °C, plates were washed, blocked and test 
sera was added. After a 1.5 hr incubation, plates are washed and bound antibody is 
detected by addition of phosphatase-conjugated secondary antibody and development by 
pNPP. 

Dot Blots: Antigen (2 jag) was blotted onto nitrocellulose, blocked and allowed to air 
20 dry. Blots were incubated 4 hr with test sera at a 1 : 100 dilution. A secondary antibody 
peroxidase/TMB detection system was used. 

Western Blots: Immunoblots were carried out using commercially available strips 
(Organon Teknika) with test sera at 1:100. 

Viral Lysate Immunoprecipitation: HIV-1 infected cells (ISB/H9) were lysed and 
25 mixed with immune sera at a dilution of 1 : 100. Following incubation with Protein A- 
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agarose, immunoprecipitates were separated by SDS-PAGE and probed with a gp41 
specific monoclonal antibody. 

Cell Surface Immunoprecipitation: Two days post transfection, 1.5 x 10 7 envelope 
expressing 293T cells were incubated with experimental sera with and without sCD4 
5 ( lOpg/ml unless otherwise noted). Following incubation, cells were lysed and incubated 
with Protein A-agarose. Immunoprecipitates were separated by SDS-PAGE and probed 
with a gp41 specific monoclonal antibody. 

Neutralization Assay: Test sera was incubated at a 1 : 10 dilution with indicated amount 
of virus (HIV-1 mB) for lhr at 37 °C. At the end of this time target cells were added 
10 (CEM) and the experiment was returned to the incubator. On days 1, 3 and 5, post- 
infection complete media changes were carried out. On day 7 PI culture supernatant 
were harvested. Levels of virus replication were determined by p24 antigen capture. 
Levels of replication in test wells were normalized to virus only controls. 

Results 

1 5 Rabbits or guinea pigs were immunized and sera analyzed by methods described 

above. The following data describe the characterization of polyclonal antibodies 
generated to various immunogens that are the subject of this invention. 

Table V illustrates results of the analysis of polyclonal sera to various 
immunogens analyzed by peptide ELISA or dot blots. Several immunogens elicited a 

20 strong antibody response in these assays. For example, immunization with P 1 5 resulted 
in sera with strong antibody reactivity to P 15 by peptide ELISA (titer >1: 102400), and 
strong reactivity to P15, a mixture of P15 + P16 and HIV-1 gp41 by dot blot. Similar 
results were obtained in these assays following immunization with a mixture of P15 and 
PI 6 (Table V). 

25 Description of Table V: Analysis of polyclonal sera to various immunogens by 

peptide ELISA or dot blot. For this and subsequent figures all results are based on 
immunizations of rabbits except for immunizations with P- 1 7 or P- 1 8 alone which were 
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performed in guinea pigs. The immunogens used are indicated in the vertical list on the 
left side of the table. The antigens used in each assay are indicated on the top row of the 
table. Peptide ELISA results are presented as titers (the maximum dilution that gives a 
positive result in the assay). Dot blot results are scored from - (no reactivity) to +++ 
(very strong reactivity). HIV TM is HIV-1 gp41. For Table V, *BS 3 refers to chemically 
cross linked; and ND indicates "not determined." 
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These results were confirmed and extended by analysis of the polyclonal sera for 
reactivity with HIV-1 gpl20, gp41 or gpl60 by western blot or immunoprecipitation 
(Table VI). For example, immunization with P15orP15+P16 elicited antibodies that 
reacted with gpl60 by western blot, and which precipitated gp41 in infected cell lysates. 
5 Of particular interest, PI 5 + P16 elicited an immune response that reacted with cell 
surface gp41, but only following treatment of the cells with sCD4 (Figure 4). Previous 
reports have found that sCD4 binds to gpl20 resulting in conformational changes in 
gpl20/gp41 or stripping of gpl20 from gp41. This process presumably mimics events 
that occur at attachment of HIV-1 to its receptor CD4 on target cells. The present results 

10 suggest that immunization with the mixture ofP15+P16 elicits an immune response to 
cryptic epitopes on gp41 that are only exposed following binding of gpl20 to CD4. 
Table VI: Analysis of polyclonal sera to various immunogens by western blot or 
immunoprecipitation. The immunogens used are indicated in the vertical list on the left 
side of the table. The antigens used in each assay are indicated on the top row of the 

1 5 table. Results are scored from - (no reactivity) to ++++ (very strong reactivity), w: weak 
reactivity; * : BS 3 chemically cross-linked prior to administration; ND: not determined; 
HIVTM:HrV-lgp41. 
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Table VI 
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Figure 5 provides data demonstrating that immunogens of the present invention 
elicit a neutralizing antibody response. While some non-specific inhibition of HTV-1 
replication is seen following incubation with pre-bleed sera, considerably greater 
15 inhibition is seen following incubation with sera from animals immunized with P15 or 
P 1 5 + P 1 6. These results indicate that these sera contain neutralizing antibodies resulting 
from immunization with the immunogen of, and by the methods of, the current 
invention. 

These data are supported by the fact that monoclonal antibodies have been 
20 generated in mice to several of the immunogens discussed above. When analyzed by 
some of the methods described above similar results were obtained to those seen with the 
polyclonal sera (not shown). 
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Discussion 

The structural components of gp41, which are present only during virus entry, 
form a novel set of neutralizing epitopes. The relatively short lived nature of these entry 
relevant structures and their presence only during natural infection would account for the 

5 observation that neutralizing antibodies targeting gp41 epitopes are poorly represented 
in sera from HIV infected individuals and all but absent in vaccinee sera. This theory 
is supported by work involving synthetic peptides which model the regions of gp41 
identified as taking part in the entry related structural reorganization (Wild, C, et al, 
Proc. Natl. Acad. Sci. USA 59:10537-10541 (1992); Wild, C, et al t AIDS Res. Hum. 

10 Retroviruses 9:1051-1053 (1993); Wild, C, etal, Proc. Natl Acad. Sci. USA 97:12676- 
12680 (1994); Wild, C, et al t AIDS Res. Hum. Retroviruses 77:323-325 (1995); Wild, 
C, etal, Proc. Natl Acad. ScL USA 9 1:9110-911 A (1994)). It has been shown that these 
materials inhibit HIV infection by blocking virus entry and the mechanism of then- 
activity is their ability to interact with and disrupt gp41 structural components critical 

15 to the entry event. Although transitory, these gp41 entry structures are both accessible 
and appropriately sensitive targets for neutralizing antibody. 

Independent of their neutralizing potential, monoclonal antibodies targeted to 
conserved structures in the TM will prove invaluable as reagents for dissecting the 
structural transitions that occur in Env as part of virus entry. 

20 We have been successful in our initial attempt to generate a structure specific 

antibody against the coiled-coil region of gp41. In this work we used a modified form 
of the P-I7 peptide as immunogen and generated MAbs that recognize the structured 
peptide but not a proline containing P-17 analog which is unstructured. Also, this 
antibody can co-immunoprecipitate the P-18 peptide. 
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Example 2 

Neutralizing Antibody Response to Peptides Modeling 
the C-helical Region ofgp41 

This example measures the humoral response to antigens modeling the C-region 
5 of gp41 . This work used synthetic peptides and a recombinant form of viral protein to 

characterize antibodies raised against the C-helical regions of gp41 of the viral TM. 

These studies employ antibody binding assays to determine the ability of these 

materials to generate an immune response to various forms of envelope (native vs. 

denatured) and virus neutralization assays to characterize the antibody response raised 
10 against these gp41 domains. The complete panel of immunogens has generated data 

which allow new insight into the antigenic nature of gp41. Most encouraging have been 

the results from Guinea Pigs immunized with the peptide, P-18, modeling the C-helix 
entry domain (amino acid residues 643-678 of-gp4 1). -Specifically, two of three anim^ 

receiving this material exhibited a neutralizing antibody response against divergent virus 
15 isolates in a variety of assay formats. Additional studies have confirmed these results. 

See Example 3. 

In study 1, guinea pigs were immunized intramuscularly with 100 \xg of P-18 
formulated in either Freund's complete (prime) or incomplete (boost) adjuvant. Animals 
were immunized on days 0, 21, 34, 48 and 62. Blood was collected on days 44, 58 and 

20 72. In the initial screen, sera at at 1:10 dilution were tested for ability to inhibit virus- 
induced cell killing. In these assays two of the three animals receiving the P-18 peptide 
(guinea pigs gp233 and gp234) were able to block the cytopathic effects of a pair of 
prototype HTV-1 isolates. Against the MN isolate >80% protection was achieved while 
against RF protection was >50% . 

25 In an assay employing the same format (against HIV-1 MN ), we titrated the sera 

from gp233 and gp234. As can be seen in Figure 6a, these animals displayed the 
expected dose-related anti-viral activity. Guinea pigs 233 and 234 gave a 50% reduction 
in virus-induced cell killing at 1:40 and 1:37 dilutions, respectively. 

A neutralization assay was carried out employing a different target cell and 

30 endpoint analysis. In this format, CEM T-cell line was inoculated with 200 TCID 50 of 
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the HIV-1 MW isolate. The reduction in viral replication for gp233 and gp 234 at a serum 
dilution of 1: 10 is shown in Figure 6b. 

Figure 6a shows the titration of bleed 2 for each animal against HIV-1 MN in the 
cell killing assay which uses cell viability as a measure of virus neutralization. MT2 
5 cells are added to a mixture of virus (sufficient to result in greater than 80% cell death 
at 5 days post infection) and sera which had been allowed to incubate for approximately 
1 hr. After 5 days in culture, cell viability is measured by vital dye metabolism. Figure 
6b shows the percent neutralization for each bleed at a 1: 10 dilution against HIV-l,^ in 
an assay format employing CEM targets and p24 endpoint. In this assay, sera are 

10 incubated with 200 TCID 50 of virus for 1 hr prior to the addition of cells. On days 1,3, 
and 5 media are changed. On day 7 culture supernatants are collected and analyzed for 
virus replication by p24 antigen levels. In each assay format, percent neutralization is 
determined by comparison of experimental wells with cell and cell/virus controls. 

The pattern of vims neutralization observed in the previous assays is repeated. 

15 At this serum dilution, bleed # 2 for guinea pigs 233 and 234 gave 80% and 90% virus - 

neutralization, respectively. The same pattern of results was observed against the HTV- 
1 SF2 isolate where under identical assay conditions bleed # 2 from animals 233 and 234 
gave 70% and 50% neutralization. Control animals receiving adjuvant only exhibited 
no neutralizing activity. 

20 These sera neutralize the HIV- 1 isolates MN, RF, and SF2. These results indicate 

a breadth of activity unseen in most other subunit immunogens. By comparison, sera 
generated against V3 peptides are restricted in their activity to a small set of very closely 
related isolates. Due to the nature of the experiment the low antibody titers are not 
unexpected. These animals were immunized with free peptide formulated in Freund's 

25 adjuvant. Neither carrier molecules nor accessory proteins were used to enhance the 
immune response to this molecule. Results from binding assays indicate low but 
appreciable levels of antibody against viral envelope. 

In ELIS A assays using recombinant gp4 1 endpoint titers of 1 : 6400- 1 :44,800 were 
observed for these samples. Linking P-18 to KLH (or other carrier molecules) and/or 

30 administering the conjugate in an adjuvant designed to enhance the immunogenicity of 
subunit antigens is expected to result in a significant increase in neutralizing response. 
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Example 3 

In a second study, 2 out of 3 animals immunized with P-18 neutralized the HTV-1 
MN isolate in the assay using the MT2 cell line. 

animal neut50 titer 

5 BT004 1:21 

BT005 1:14 

Also, one animal receiving P-18 coupled to KLH neutralized the MN isolate in 
the same assay format. 

animal neut50 titer 

10 BT 007 1:15 

Example 4 

The peptide used to generate the immune response in Example 2 includes within 
its sequence the linear epitope for the 2F5 monoclonal antibody. To determine if our 
immune response was against this same region of envelope, or involved a previously 

15 unidentified neutralizing epitope, a series of binding experiments was carried out to 
characterize the reactivity of our polyclonal sera. As can be seen in Table 1, at a dilution 
of 1 : 100 all animals exhibit good ELISA binding to the cognate immunogen (P-18). Sera 
from these animals also have substantial antibody titers against a peptide derived from the 
N-icrminal P-18 sequence, PI (Table VJT). However, when tested at this same dilution 

20 against a pair of C-terminal P-18 analogs, P2 andP3 (Table VII) no ELISA reactivity was 
observed (Table Vffl). This result is significant in that the P3 peptide includes the linear 
binding region (ELDKWAS) for the 2F5 monoclonal antibody. These results demonstrate 
that the neutralizing activity in our sera is not due to binding to the 2F5 epitope. 



BNSDOCID: <WO_0040ei6A1_L> 



WO 00/40616 



PCT/USOO/00456 



-53- 



Table VII 

Set of three overlapping peptides corresponding to the P-18 peptide 

PI YTSLIHSLIEESQNQQEK (SEQ ID NO:77) 

P2 EESQNQQEKNEQELLELD (SEQ ID NO:78) 

5 P3 LELDKWASLWNWF 

(SEQ ID NO:79) 

P- 1 8 YTSLfflSLffiESQNQQEKNEQELLELDKWASLWNWF (SEQ ID NO:5) 

Table VIII 

ELIS A binding at 1 : 100 (OD) 

10 Sample PI P2 P3 £^18 

gp232-2 0.833 0.124 0.003 1.423 

gp232-3 0.858 0.022 0.009 1.067 

gp233-2 1.024 0.019 0.010 1.314 

gp233-3 0.885 0.015 0.015 1.161 

15 gp234-2 0.492 0.015 0.016 1.152 

gp234-3 0.796 0.012 0.009 0.913 

ELIS A binding by guinea pig sera to P-18 and a set of overlapping peptides 
corresponding to P-18. The majority of the antibody binding is to P-18 and the 
N-terminal peptide PI. Very little or no reactivity is observed against P2 and P3 
20 modeling the C-terminal region of P- 1 8 . 



BNSDOCID: <WO 0Q40B16A1 I > 



WO 00/40616 



-54- 



PCT/USOO/00456 



Example 5 

Expression of Recombinant gp41 Construct 

The plasmid for expression of the construct containing the N- and C-helical 
domains of HW-1 gp41 separated by a short linker sequence (See FIG. 7) was prepared 

5 as follows. The bacterial expression vector pTCLE-ssG2C, (based on pAED-4, a T7 
expression vector developed specifically for the expression of small proteins) provided 
by Dr. Terrance Oas, Duke University was digested at the unique restriction sites Ndel 
and Ecorl and gel purified using the Qiaex system. The DNA fragments encoding the N- 
and C-helical regions of HIV-1 gp41 and a short linker sequence were PCR amplified 

10 by standard techniques from gp41 expression vectors using the following primers. 
N-helix primer pair: 

5'; 5' GGG CCC ATA TGG GTA TTG TTC AGC AG 3' (includes Ndel site), 

(SEQIDNO:80) 

3'; 5' GGG CCG GCG CCT GAG CCG CCG CCT TGA TCC TTC AGG TAG CGT TC 
15 y (includes Narl site). (SEQIDNO:81) 
C-helix primer pair: 

5'; 5* GGG CCG GCG CCG GCT CAG AGT GGG ACA GAG AAA TTA ACA ATT 
AC 3' (includes Narl site), (SEQ ID NO:82) 

3'; 5' GGG CCG AAT TCT TAA AAC CAA TTC CAC AAA CTT GCC CAT TT 3' 

20 (includes Ecorl site and a stop codon). (SEQIDNO:83) 
These fragments were inserted (blunt end ligation) into the TA vector which was 
amplified to generate larger amounts of DNA. The fragments coding for to the N and C- 
helices were released from the TA vector by restriction digest (C-helix: Narl and EcoRI, 
N-helix: Ndel and Narl) and gel purified. A three-way ligation was performed using 

25 standard procedures to introduce the DNA coding for the N- and C-helical fragments into 
the pTCLE-ssG2C vector. The product of this step was characterized by restriction 
digestion and DNA sequencing. The vector containing the desired gp4 1 coding region was 
prepared in large quantity and BL-21 E. coli host cells were transformed and induced to 
express the desired protein. The desired proteins may or may not have a methionine as 
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the first amino acid at he N-terminus. Over-expression of a protein of the appropriate 
molecular weight was observed by SDS-Page gel electrophoresis. 

Although the foregoing refers to particular preferred embodiments, it will be 
understood that the present invention is not so limited. It will occur to those of ordinary 
5 skill in the art that various modifications may be made to the disclosed embodiments and 
that such modifications are intended to be within the scope of the present invention, which 
is defined by the following claims. 

All publications, patents and patent applications mentioned in this specification are 
indicative of the level of skill of those in the art to which the invention pertains. All 
10 publications, patents and patent applications are herein incorporated by reference to the 
same extent as if each individual publication or patent application was specifically and 
individually indicated to be incorporated by reference in their entirety. 
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What Is Claimed Is: 

1. A method of raising a broadly neutralizing antibody response to HIV, 
comprising: 

administering to a mammal a peptide or polypeptide comprising an amino acid that 
5 is capable of forming a stable coiled-coil solution structure corresponding to or mimicking 
the heptad repeat region of gp4 1 , or a fragment thereof. 

2. The method of claim 1 , wherein a peptide is administered, and wherein said 
peptide comprises about 28 to 55 amino acids of the following sequence: 
ARQIXSGIVQQQNN1XRAIEAQQHLXQLTVWGK 

10 (SEQ. ID NO: 1), or multimers thereof. 

3. The method of claim 2, wherein the peptide isjcpnjugated to a earner 

protein. 

4. The method of claim 3, wherein said carrier protein is keyhole limpet 
hemocyanin (KLH), ovalbumin, bovine serum albumin (BSA) or tetanus toxoid. 



15 5. The method of claim 1, wherein said peptide is one of SEQ ID NO: 1, 

SEQ ID NO: 2, SEQ ED NO: 3, or one of SEQ ID NO: 9 through SEQ ID NO: 40; 
and wherein the peptide can be optionally coupled to a larger carrier protein, or optionally 
include a terminal protecting group at the N- and/or C- termini. 

6. The method of claim 1 , wherein said peptide has the formula, from amino 

20 . terminus to carboxy terminus, of: 

^ffl r NNLLRAEAQQHLLQLTVWGIKQLQARILAVERYLKDQ-COOH 

(SEQ ID NO: 1); 

or: 

NH r SGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARIL-COOH 
25 (SEQIDNO:2); 
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and wherein the peptide can be optionally coupled to a larger carrier protein, or optionally 
include a terminal protecting group at the N- and/or C- termini. 

7. The method of claim 1, wherein said peptide includes one to 10 
conservative substitutions. 

5 8. A method of raising a broadly neutralizing antibody response to HIV, 

comprising: 

administering to a mammal a peptide or polypeptide comprising an amino acid 
sequence that corresponds to, or mimics, the transmembrane-proximal amphipathic a- 
helical segment of gp4 1 (at the C-helical domain of gp4 1 ), or a fragment thereof, wherein 
10 said mammal raises antibodies to a helical solution structure of said peptide or 
polypeptide. 

9 . The method of claim 8, wherein a peptide is administered, and wherein said 
peptide comprises about 24-56 amino acids of the following sequence: 
WNNMTWMEWDREINNYTSLffi 

1 5 N1TNW (SEQ ID NO:4), or a multimer thereof. 

10. The method of claim 8, wherein the peptide or polypeptide is conjugated 
to a carrier protein. 

11. The method of claim 10, wherein said carrier protein is keyhole limpet 
hemocyanin (KLH), ovalbumin, bovine serum albumin (BSA) or tetanus toxoid. 

20 12. The method of claim 8, wherein said peptide includes one to 10 

conservative substitutions. 

13. The method of claim 8, wherein said peptide one of SEQ ID NO: 4, SEQ 
ID NO: 5, SEQ ID NO: 6, or one of SEQ ID NO: 41 through SEQ ID NO: 74; 
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and wherein the peptide can be optionally coupled to a larger carrier protein, or optionally 
include a terminal protecting group at the N- and/or C- termini. 

14. The method of claim 8, wherein said peptide has the formula, from amino 

terminus to carboxy terminus, of: 
5 NH 2 -YTSLIHSLIEESQNQQEKNEQELLELDKWASLWNWF-COOH 

(SEQE>NO:3); 

or: 

>m 2 -WMEWDREI>WYTSLIHSLffiESQNQQEKNEQELL-COOH 

(SEQ ID NO:4) 

10 and wherein the peptide can be optionally coupled to a larger carrier protein, or optionally 
include a terminal protecting group at the N- and/or C- termini. 

15. A method of raising a broadly neutralizing antibody response to^ HIV, 
comprising: 

administering to a mammal a composition including one or more peptides or 
1 5 polypeptides which comprise amino acid sequences that are capable of forming solution 
stable structures that correspond to, or mimic, the gp41 core six helix bundle. 

16. The method of claim 15, wherein said one or more peptides or 
polypeptides comprise a mixture of C-helical peptide or polypeptide and N-helical peptide 
or polypeptide. 

20 17. The method of claim 15, wherein at least one of said peptides or 

polypeptides is multimeric, or is a conjugate structure comprised of an N-helical domain 
amino acid sequence and a C-helical domain amino acid sequence. 

18. The method of claim 15, wherein said mixture of C-helical peptide or 
polypeptide and N-helical peptide or polypeptide forms a stable core helix solution 
25 structure. 
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19. The method of claim 15, wherein said mixture comprises: 
P-17andP-18, 

P-15andP-16, 
P-17andP-16or 
5 P-15andP-18. 

20. A method of raising a broadly neutralizing antibody response to HIV, 
comprising: 

administering to a mammal a composition including one or more conjugate 
peptides or polypeptides formed from two or more amino acid sequences that comprise: 
10 (a) one or more amino acid sequences that are capable of forming a stable coiled- 

coil solution structure corresponding to or mimicking the heptad repeat region of 
gp41 (N-helical domain); and 

(b) one or more amino acid sequences that correspond to, or mimic, an amino acid 
sequence of the transmembrane-proximal amphipathic a-helical segment of gp4 1 
15 (C-helical domain) ; 

wherein 

said one or more sequences (a) and (b) are alternately linked to one another via 
a bond, such as a peptide bond (amide linkage) or by an amino acid linking sequence 
consisting of about 2 to about 25 amino acids. 

20 21. The method of claim 20, wherein said conjugates are recombinantly 

produced. 

22. The method of claim 2 1 , wherein one or more of said conjugates folds and 
assembles in solution into a structure corresponding to, or mimicking, the gp41 core six 
helix bundle. 

25 23. The method of claim 20, wherein: 

said N-helical peptide comprises about 28 to 55 amino acids of the following 
sequence: 
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ARQIXSGIVQQQNNIXRAIEAQQHIJLX^LTVW 
(SEQ. ID NO: 1 ), or multimers thereof; and 

said C-helical peptide comprises about 24-56 amino acids of the following 

sequence: 
5 WNNMTWMEWDREINN^ 

NUNW (SEQ ID NO:4), or multimers thereof. 

24. The method of claim 14 or claim 20, wherein: 

said N-helical peptide is one of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, 
or one of SEQ ID NO: 9 through SEQ ID NO: 40, and wherein the peptide can be 
10 optionally coupled to a larger carrier protein, or optionally include a terminal protecting 
group at the N- and/or C- termini; and 

said C-helical peptide is one of SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, 

or one ofSEQ. ID NO: 41 Jhrougb i SEQ ID NO: 74, and wherein the peptide can be 

optionally coupled to a larger carrier protein, or optionally include a terminal protecting 
1 5 group at the N- and/or C- termini. 

25. A conjugate peptide or polypeptide formed from two or more amino acid 

sequences that comprise: 

(a) one or more amino acid sequences that are capable of forming a stable coiled- 
coil solution structure corresponding to or mimicking the heptad repeat region of 

20 gp41 (N-helical domain); and 

(b) one or more amino acid sequences that correspond to, or mimic, an amino acid 
sequence of the transmembrane-proximal amphipathic a-helical segment of gp41 
(C-helical domain); 

wherein 

25 said one or more sequences (a) and (b) are alternately linked to one another via 

a bond, such as a peptide bond (amide linkage) or by an amino acid linking sequence 
consisting of about 2 to about 25 amino acids. 
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26. The conjugate of claim 25, wherein: 

said N-helical peptide comprises about 28 to 55 amino acids of the following 
sequence: 

ARQIJ^GIVQQQNNIJLRAIEAQQH^ 
5 (SEQ. ID NO: 1), or multimers thereof; and 

said C-helical peptide comprises about 24-56 amino acids of the following 
sequence: 

WNNMTWMEWDREINNYTSLIHSLIEESQNQQEKNEQELLE 
NITNW (SEQ TD NO:4), or multimers thereof. 

10 27. The conjugate of claim 25, wherein: 

said N-helical peptide is one of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, 
or one of SEQ ID NO: 9 through SEQ ID NO: 40, and wherein the peptide can be 
optionally coupled to a larger carrier protein, or optionally include a terminal protecting 
group at the N- and/or C- termini; and 

1 5 said C-helical peptide is one of SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO:' 6, 

or one of SEQ ID NO: 41 through SEQ ID NO: 74, and wherein the peptide can be 
optionally coupled to a larger carrier protein, or optionally include a terminal protecting 
group at the N- and/or C- termini. 

28 . A pharmaceutical composition comprising a conjugate of claim 25, and a 
20 pharmaceutical acceptable carrier. 

29. A composition comprising polyclonal or monoclonal antibodies that are 
raised to the conjugate of claim 25. 

30. A composition comprising a mixture of C-helical peptide or polypeptide 
and N-helical peptide or polypeptide, wherein said mixture forms a stable core helix 

25 solution structure. 
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31. The composition of claim 30, wherein: 

said N-helical peptide comprises about 28 to 55 amino acids of the following 
sequence: 

ARQU^GIVQQQNNIXRAIEAQQHLLQLTVW 
5 (SEQ. ID NO: 1), or multimers thereof; and 

said C-helical peptide comprises about 24-56 amino acids of the following 

sequence: 

WNNMTWMEWDREIN^ 

NITNW (SEQ ID NO:4), or multimers thereof. 

10 32. The composition of claim 30, wherein: 

said N-helical peptide is one of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, 
or one of SEQ ID NO: 9 through SEQ ID NO: 40, and wherein the peptide can be 

_ optionally coupled to_a larger carrier protein, or option 

group at the N- and/or C- termini; and 
15 said C-helical peptide is one of SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, 

or one of SEQ ID NO: 41 through SEQ ID NO: 74, and wherein the peptide can be 
optionally coupled to a larger carrier protein, or optionally include a terminal protecting 
group at the N- and/or C- termini. 

33 . A composition comprising polyclonal or monoclonal antibodies that are 
20 raised to the composition of claim 30. 

34. A method of treatment, comprising: 

administering to an individual a composition comprising polyclonal or monoclonal 
antibodies as claimed in claim 29 or claim 33 in an amount effective to reduce HIV 
25 infection of uninfected cells. 

35. An isolated nucleic acid molecule comprising a polynucleotide having a 
nucleotide sequence at least 95% identical to a sequence encoding a peptide or 
polypeptide conjugate of claim 25. 
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36. The nucleic acid molecule of claim 35, wherein said polynucleotide has the 
nucleotide sequence in FIG. 7. 

37. A method for making a recombinant vector comprising inserting an 
isolated nucleic acid molecule of claim 35 into a vector. 

38. A recombinant vector produced by the method of claim 37. 

39. A method of making a recombinant host cell comprising introducing the 
recombinant vector of claim 38 into a host cell. 

40. A recombinant host cell produced by the method of claim 39. 

41. A recombinant method for producing a conjugate peptide or polypeptide, 
comprising culturing the recombinant host cell of claim 40 under conditions such that said 
polypeptide is expressed and recovering said polypeptide. 

42. The method of claim 1, claim 8, claim 15 or claim 20, wherein said 
administering is provided in advance of any symptoms of HIV infection, or in advance of 
any known exposure to HIV. 

43. The method of claim 1, claim 8, claim 15 or claim 20, wherein said 
administering is provided upon or after the detection of symptoms which indicate that an 
animal may be infected with HIV, or upon or after exposure to the virus. 
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SEQUENCE LISTING 



<110> Wild, Carl T . 

Weiss, Carol D. 

<120> Methods of Eliciting Broadly Neutralizing Antibodies 
Targeting HIV-1 gp41 

<130> 1589.016PC01 

<140> 
<141> 

<150> 60/115,404 
<151> 1999-01-08 

<150> to be assigned 
<151> 2000-01-07 

<160> 84 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 55 
<212> PRT . 

<213> Human immunodeficiency virus type 1 
<400> 1 

Ala Arg Gin Leu Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu 
15 10 15 

Arg Ala He Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly 
20 25 30 

He Lys Gin Leu Gin Ala Arg He Leu Ala Val Glu Arg Tyr Leu Lys 
35 40 45 

Asp Gin Gin Leu Leu Gly He 
50 55 



<210> 2 
<211> 38 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 2 

Asn Asn Leu Leu Arg Ala lie Glu Ala Gin Gin His Leu Leu Gin Leu 
1 5 10 15 

Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg He Leu Ala Val Glu 
20 25 30 

Arg Tyr Leu Lys Asp Gin 
35 



<210> 3 
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<211> 36 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 3 

Ser Gly lie Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie Glu Ala 
1 5 10 15 

Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin 
20 25 30 

Ala Arg lie Leu 
35 



<210> 4 
<211> 56. 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 4 

Trp Asn Asn Met Thr Trp Met Glu Trp Asp Arg Glu lie Asn Asn Tyr 
1 5 10 15 

• Thr Ser- Leu lie His Ser Leu lie Glu Glu Ser "Gin "Ash Gin Glri Glu 
20 25 30 

Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys, Trp Ala Ser Leu Trp 
35 40 45 

Asn Trp Phe Asn lie Thr Asn Trp 
50 55 



<210> 5 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 5 

Tyr Thr Ser Leu lie His Ser Leu lie Glu Glu Ser Gin Asn Gin Gin 
15 10 15 

Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu 
20 25 30 

Trp Asn Trp Phe 

35 : 



<210> 6 
<211> 34 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 6 

Trp Met Glu Trp Asp Arg Glu He Asn Asn Tyr Thr Ser Leu He His 
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10 



15 



Ser Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu 
20 25 30 

Leu Leu 



<210> 7 
<211> 5 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 

<400> 7 
ggqgs 



<210> 8 
<211> 345 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<400> 8 

Ala Val Gly lie Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly 
15 10 15 

Ser Thr Met Gly Ala Arg Ser Met Thr Leu Thr Val Gin Ala Arg Gin 
20 25 30 

Leu Leu Ser Gly lie Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie 
35 40 45 

Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp. Gly lie Lys Gin 
50 55 60 

Leu Gin Ala Arg lie Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin Gin 
65 70 75 80 

Leu Leu Glv lie Trp Gly Cys Ser Gly Lys Leu lie Cys Thr Thr Ala 
85 90 95 

Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Glu Gin lie Trp 
100 105 110 

Asn Asn Met Thr Trp Met Glu Trp Asp Arg Glu lie Asn Asn Tyr Thr 
] 3 E 120 125 

Ser Let: ills Ser Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys 

13C 135 140 

Asn Glu Glr. Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn 
145 150 155 160 

Trp Phe Asn He Thr Asn Trp Leu Trp Tyr lie Lys lie Phe lie Met 
165 170 175 

lie Val Glv Gly Leu Val Gly Leu Arg .lie Val Phe Ala .Val Leu Ser 
180 185 190 
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Ile Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr 
195 200 205 

His Leu Pro Thr Pro Arg Gly Pro Asp Arg Pro Glu Gly He Glu Glu 
210 215 220 

Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser He Arg Leu # Val Asn Gly 
225 230 235 240 

Ser Leu Ala Leu He Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe Ser 
245 250 255 

Tyr His Arg Leu Arg Asp Leu Leu Leu He Val Thr Arg He Val Glu 
260 265 270 

Leu Leu Gly Arg Arg Gly Trp Glu Ala Leu Lys Tyr Trp Trp Asn Leu 
275 280 285 

Leu Gin Tyr Trp Ser Gin Glu Leu Lys Asn Ser Ala Val Ser Leu Leu 
290 295 300 

Asn Ala Thr Ala He Ala Val Ala Glu Gly Thr Asp Arg Val He Glu 
305 310 315 320 

Val Val Gin Gly Ala Cys Arg Ala He Arg His He Pro Arg Arg He 
325 330 335 

Arg -Gin -Gly Leu Glu Arg lie Leu Leu - 

340 345 



<210> 9 
<211> 45 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 9 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala 
1 .5 10 15 

Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin 
20 25 30 

Ala Arg He Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
35 40 45 



<210> 10 
<211> 45 
<212> PRT 

<213> Human, immunodeficiency virus 
<400> 10 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala 
1 5 10 15 

Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu Ala Leu Glu Arg Tyr Leu Arg Asp Gin 
35 40 45 
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<210> 11 
<211> 36 
<212> PRT 



<213> Human 



inununodef iciency virus 



<400> 11 
Ser Gly lie 
1 



Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie Glu Ala 
5 10 15 



Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 
20 25 30 



Ala Arg Val Leu 
35 



<210> 12 
<211> 38 
<212> PRT 

<213> Human immunodeficiency virus 



<400> 12 

Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin His Leu Leu Gin Leu 
• ! 5 10 15 

Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val Leu Ala Leu Glu 
20 25 30 

Arg Tyr Leu Arg Asp Gin 
35 



<210> 13 
<211> 45 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 13 

Ser Gly lie Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala 
15 10 15 

Gin Gin Arg Met Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Gly Asp Gin 
35 40 45 



<210> 14 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 14 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala 
15 10 15 

Gin Gin Arg Met Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu 
35 
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<210> 15 
<211> 38 
<212> PRT 

<213> Human immunodeficiency virus 

<400> 15 _ T 

Asn Asn Leu Leu Arg Ala lie Glu Ala Gin Gin Arg Met Leu Gin Leu 
1 5 10 15 

Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val Leu Ala Val Glu 
20 25 30 

Arg Tyr Leu Gly Asp Gin 
35 



<210> 16 
<211> 45 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 16 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala 
15 10 15 

Gin Gin His Met Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu Ala Leu Glu Arg Tyr Leu Arg Asp Gin 
35 40 45 



<210> 17 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 17 

Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala 
1 5 10 15 

Gin Gin His Met Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu 
35 



<210> lb 

<211> 38 

<212> PRT 

<213> Humcir. irrimunodef iciency virus 

<400> 18 

Asn Asn Uu :.ou Arg Ala He Glu Ala Gin Gin His Met Leu Gin Leu 
1 5 10 15 

Thr Val T rL- Civ He Lys Gin Leu Gin Ala Arg Val Leu Ala Leu Glu 
20 25 30 

Arg Tyr Leu Arg Asp Gin 
35 
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<210> 19 
<211> 45 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 19 

Ser Gly lie Val Gin Gin Gin Ser Asn Leu Leu Arg Ala lie Glu Ala 
15 10 15 

Gin Gin His Met Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu Ala lie Glu Arg Tyr Leu Arg Asp Gin 
35 40 45 



<210> 20 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 20 

-Ser Gly lie Val Gin Gin Gin Ser Asn Leu Leu Arg Ala lie Glu Ala 
15 10 15 

Gin Gin His Met Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu 
35 



<210> 21 
<211> 38 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 21 

Ser Asn Leu Leu Arg Ala lie Glu Ala Gin Gin His Met Leu Gin Leu 
1 5 10 15 

Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg Val Leu Ala lie Glu 
20 25 30 

Arg Tyr Leu Arg Asp Gin 
35 



<210> 22 
<211> 45 
<212> PRT . 

<213> Human immunodeficiency virus 
<400> 22 

Ser Gly lie Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie Glu Ala 
15 10 15 

Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu Ala Val Glu Ser Tyr Leu Lys Asp Gin 
35 40 45 
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<210> 23 
<21i> 38 
<212> PRT 

<213> Human immunodeficiency virus 
<4O0> 23 

Asn Asn Leu Leu Arg Ala He Glu Ala Gin Gin His Leu Leu Gin Leu 
15 10 15 

Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val Leu Ala Val Glu 
20 25 30 

Ser Tyr Leu Lys Asp Gin 
35 



<210> 24 
*<211> 45 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 24 

Ser Glv He Val Gin Gin Gin Ser Asn Leu Leu Arg Ala He Glu Ala 
15 10 15 

Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin 
20 25 30 



Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Gin Asp Gin 
35 40 45 



<210> 25 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 25 

Ser Gly He Val Gin Gin Gin Ser Asn Leu Leu Arg Ala lie Glu Ala 
15 10 15 

Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu 
35 



<210> 26 
<211> 38 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 26 

Ser Asn Leu Leu Arg Ala lie Glu Ala Gin Gin His Leu Leu Gin Leu 
15 10 15 

Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val Leu Ala Val Glu 
20 25 30 

Arg Tyr Leu Gin Asp Gin 
35 
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<210> 27 
<211> 45 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 21 

Ser Gly lie Val Gin Gin Gin Ser Asn Leu Leu Arg Ala lie Glu Ala 
15 10 15 

Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu Ala Leu Glu Arg Tyr Leu Arg Asp Gin 
35 40 45 



<210> 28 
<211> 38 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 28 

Ser Asn Leu Leu Arg Ala lie Glu Ala Gin Gin His Leu Leu Gin Leu 
15 10 15 

Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg Val Leu Ala Leu Glu 
20 25 30 

Arg Tyr Leu Arg Asp Gin 
35 



<210> 29 
<211> 45 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 29 

Ser Gly lie Val Gin Gin Gin Ser Asn Leu Leu Arg Ala lie Gin Ala 
1 5 10 15 

Gin Gin His Met Leu Gin Leu Thr Val Trp Gly Val Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
35 40 45 



<210> 30 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 30 

Ser Gly lie Val Gin Gin Gin Ser Asn Leu Leu Arg Ala lie Gin Ala 
15 10 15 

Gin Gin His Met Leu Gin Leu Thr Val Trp Gly Val Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu 
35 
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<210> 31 
<211> 38 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 31 

Ser Asn Leu Leu Arg Ala lie Gin Ala Gin Gin His Met Leu Gin Leu 
15 10 15 

Thr Val Trp Gly Val Lys Gin Leu Gin Ala Arq Val Leu Ala Val Glu 
20 25 30 

Arg Tyr Leu Lys Asp Gin 
35 



<210> 32 
<211> 45 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 32 

Ser Gly lie Val Gin Gin Gin Ser Asn Leu Leu Lys Ala lie Glu Ala 
1 5 10 15 

Gin Gin His Leu Leu Lys Leu Thr Val Trp Gly lie Lys Gin Leu Gin 
20 25 30 



Ala Arg^ Val Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin 
35 40 45 



<210> 33 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 33 

Ser Gly lie Val Gin Gin Gin Ser Asn Leu Leu Lys Ala lie Glu Ala 
15 10 15 

Gin Gin His Leu Leu Lys Leu Thr Val Trp Gly lie Lys Gin Leu Gin 
20 25 30 

Ala Arg Val Leu 
35 



<210> 34 
<211> 38 
<212> PRT 

<213> Human' immunodeficiency virus 
<400> 34 

Ser Asn Leu Leu Lys Ala lie Glu Ala Gin Gin His Leu Leu Lys Leu 
15 10 15 

Thr Val Trp Gly He Lys Gin Leu Gin Ala Arg Val Leu Ala Val Glu 
20 25 30 

Arg Tyr Leu Lys Asp Gin 
35 
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<210> 35 
<211> 45 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 35 

Ser Giy lie Val Gin Gin Gin Asn lie Leu Leu Arg Ala lie Glu Ala 
15 10 15 

Gin Gin His Leu Leu Gin Leu Ser He Trp Gly He Lys Gin Leu Gin 
20 25 30 

Ala Lys Val Leu Ala He Glu Arg Tyr Leu Arg Asp Gin 
35 40 45 



<210> 36 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 36 

Ser Gly He Val Gin Gin Gin Asn He Leu Leu Arg Ala He Glu Ala 
15 10 15 

Gin Gin His Leu Leu Gin Leu Ser He Trp Gly He Lys Gin Leu Gin 
20 25 30 

Ala Lys Val Leu 
35 



<210> 37 
<211> 38 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 37 

Asn He Leu Leu Arg Ala He Glu Ala Gin Gin His Leu Leu Gin Leu 
15 10 15 

Ser lie Trp Gly He Lys Gin Leu Gin Ala Lys Val Leu Ala lie Glu 
20 25 30 

Arg Tyr Leu Arg Asp Gin 
35 



<210> 38 
<211> 45 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 38 

Lys Gly He Val Gin Gin Gin Asp Asn Leu Leu Arg Ala lie Gin Ala 
15 10 15 

Gin Gin Gin Leu Leu Arg Leu Ser Xaa Trp Gly lie Arg Gin Leu Arg 
20 25 30 

Ala Arg Leu Leu Ala Leu Glu Thr Leu Leu Gin Asn Gin 
35 40 45 



BNSDOCID: <WO 004061 6A1J_> 



WO 00/40616 



PCT/USOO/00456 



-12- 

<210> 39 
<211> 35 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 39 

Lys Gly lie Val Gin Gin Gin Asp Asn Leu Leu Arg Ala lie Gin Ala 
1 5 10 15 

Gin Gin Gin Leu Leu Arg Leu Ser Xaa Trp Gly lie Arg Gin Leu Arg 
20 25 30 

Ala Arg Leu 
35 



<210> 40 
<211> 38 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 40 

Asp Asn Leu Leu Arg Ala lie Gin Ala Gin Gin Gin Leu Leu Arg Leu 
15 10 15 

Ser Xaa Trp Gly lie Arg Gin Leu Arg Ala Arg Leu Leu Ala Leu Glu 
20 25 30 



Thr Leu Leu Gin Asn Gin 
35 



<210> 41 
<211> 46 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 41 

Trp Met Glu Trp Asp Arg Glu lie Asn Asn Tyr Thr Ser Leu lie His 
1-5 10 15 

Ser Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu 
20 25 30 

Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe 
35 40 45 



<210> 42 
<211> 46 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 42 

Trp Met Glu Trp Glu Arg Glu lie Glu Asn Tyr Thr Gly Leu lie Tyr 
15 10 15 

Thr Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Asp 
20 25 30 

Leu Leu Ala Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe 
35 40 45 
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<210> 43 
<211> 34 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 43 

Trp Met Glu Trp Glu Arg Glu lie Glu Asn Tyr Thr Gly Leu lie Tyr 
15 10 15 

Thr Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Asp 
20 25 30 

Leu Leu 



<210> 44 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 44 

Tyr Thr Gly Leu lie Tyr Thr Leu lie Glu Glu Ser Gin Asn Gin Gin 
15 10 15 

Glu Lys Asn Glu Gin Asp Leu Leu Ala Leu Asp Lys Trp Ala Ser Leu 
20 25 30 

Trp Asn Trp Phe 

35 



<210> 45 
<211> 46 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 45 

Trp Met Glu Trp Glu Arg Glu lie Asp Asn Tyr Thr Ser Glu lie Tyr 
15 10 15 

Thr Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu 
20 25 30 

Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe 
35 40 45 



<210> 46 
<211> 34 
<212> PRT . 

<213> Human immunodeficiency virus 
<400> 46 

Trp Met Glu Trp Glu Arg Glu lie Asp Asn Tyr Thr Ser Glu lie Tyr 
15 10 15 

Thr Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu 
20 25 30 

Leu Leu 
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<210> 47 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 47 

Tyr Thr Ser Glu lie Tyr Thr Leu He Glu Glu Ser Gin Asn Gin Gin 
15 10 15 

Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu 
20 25 30 

Trp Asn Trp Phe 
35 



<210> 48 
<211> 46 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 48 

Trp Met Glu Trp Glu Arg Glu He Asp Asn Tyr Thr Asp Tyr He Tyr 
1'5 10 15 

Asp Leu Leu Glu Lys Ser Gin Thr Gin Gin Glu Lys Asn Glu Lys Glu 
20 25 30 

Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe 
35 40 45 



<210> 49 
<211> 34 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 49 

Trp Met Glu Trp Glu Arg Glu He Asp Asn Tyr Thr Asp Tyr He Tyr 
15 10 15 

Asp Leu Leu Glu Lys Ser Gin Thr Gin Gin Glu Lys Asn Glu Lys Glu 
20 25 30 

Leu Leu 



<210> 50 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 50 

Tyr Thr Asp Tyr He Tyr Asp Leu Leu Glu Lys Ser Gin Thr Gin Gin 
15 10 15 

Glu Lys Asn Glu Lys Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu 
20 : 25 30 

Trp Asn Trp Phe 
35 
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<210> 51 
<211> 46 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 51 

Trp lie Gin Trp Asp Arg Glu He Ser Asn Tyr Thr Gly He He Tyr 
15 10 15 

Arg Leu Leu Glu Glu Ser Gin Asn Gin Gin Glu Asn Asn Glu Lys Asp 
20 25 30 

Leu Leu Ala Leu Asp Lys Trp Gin Asn Leu Trp Ser Trp Phe 
35 40 45 



<210> 52 
<211> 34 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 52 

Trp He Gin Trp Asp Arg Glu lie Ser Asn Tyr Thr Gly He He Tyr 
15 10 15 

Arg Leu Leu Glu Glu Ser Gin Asn Gin Gin Glu Asn Asn Glu Lys Asp 
20 25 30 

Leu Leu 



<210> 53 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 53 

Tyr Thr Gly He lie Tyr Arg Leu Leu Glu Glu Ser Gin Asn Gin Gin 
15 10 15 

Glu Asn Asn Glu Lys Asp Leu Leu Ala Leu Asp Lys Trp Gin Asn Leu 
20 25 30 

Trp Ser Trp Phe 
35 



<210> 54 
<211> 46 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 54 

Trp Met Glu Trp Glu Arg Glu He Ser Asn Tyr Thr Gly Leu He Tyr 
1 5 10 15 

Asp Leu lie Glu Glu Ser Gin He Gin Gin Glu Lys Asn Glu Lys Asp 
20 25 30 

Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe 
35 40 45 
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<210> 55 
<211> 34 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 55 

Trp Met Glu Trp Glu Arg Glu He Ser Asn Tyr Thr Gly Leu He Tyr 
15 10 15 

Asp Leu He Glu Glu Ser Gin He Gin Gin Glu Lys Asn Glu Lys Asp 
20 25 30 

Leu Leu 



<210> 56 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 56 

Tyr Thr Gly Leu He Tyr Asp Leu lie Glu Glu Ser Gin He Gin Gin 
15 10 15 

Glu Lys Asn Glu Lys Asp Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu 
20 25 30 



Trp Asn Trp Phe 
35 



<210> 57 
<211> 46 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 57 

Trp Met Glu Trp Gin Lys Glu lie Ser Asn Tyr Ser Asn Glu Val Tyr 
15 10 15 

Arg Leu lie Glu Lys Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Gly 
20 25 30 

Leu Leu Ala Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe 
35 40 45 



<210> 58 
<211> 34 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 58 

Trp Met Glu Trp Gin Lys Glu lie Ser Asn. Tyr Ser Asn Glu Val Tyr 
1 5 10 15 

Arg Leu He Glu Lys Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Gly 
20 25 30 

Leu Leu 
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<210> 59 

<211> 36 

<212> PRT 

<213> Human immunodeficiency virus 



<400> 59 

Tvr Ser Asn Glu Val Tyr Arg Leu He Glu Lys Ser Gin Asn Gin Gin 
1 5 10 15 

Glu Lvs Asn Glu Gin Gly Leu Leu Ala Leu Asp Lys Trp Ala Ser Leu 
20 25 30 



Trp Asn Trp Phe 
35 



<210> 60 
<211> 46 
<212>' PRT 

<213> Human immunodeficiency virus 
<400> 60 

Trp He Gin Trp Asp Arg Glu He Ser Asn Tyr Thr Gin Gin He Tyr 
1 ■ 5 10 15 

Ser Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Asp 
20 25 30 

Leu Leu Ala Leu Asp Asn Trp Ala Ser Leu Trp Thr Trp Phe 
35 40 45 



<210> 61 
<211> 34 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 61 

Trp He Gin Trp Asp Arg Glu lie Ser Asn Tyr Thr Gin Gin He Tyr 
15 10 15 

Ser Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Asp 
20 25 30 

Leu Leu 



<210> e; 

<211> 3t- 

<212> PP.1 

<213> Hun.di. j mmunode f i ciency virus 

<4 00> c: 

Tyr Th: Gin Gin He Tyr Ser Leu lie Glu Glu Ser Gin Asn Gin Gin 
15 10 15 

Glu Lys Asn Glu Gin Asp Leu Leu Ala Leu Asp Asn Trp Ala Ser Leu 
20 25 30 

Trp Thr Trp Phe 

35 
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<210> 63 
<211> 46 
<212> PRT 

<213> Human immunodeficiency virus 




Ara Leu Leu Glu Leu Ser Gin Thr Gin Gin Glu Gin Asn Glu Gin Asp 
y 20 25 .30 



Leu Leu Ala Leu Asp Lys Trp Asp Ser Leu Trp Asn Trp Phe 
35 40 45 



<210> 64 
<211> 34 
<212> PRT 

<213> Human immunodeficiency virus 

Tro°Met 4 Glu Trp Asp Arg Gin lie Asp Asn Tyr Thr Glu Val He Tyr 
1 5 10 15 

Ara Leu Leu Glu Leu Ser Gin Thr Gin Gin Glu Gin Asn Glu Gin Asp 
20 25 30 

Leu Leu 



<210> 65 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 



<400> 65 mi _ _ 

Tvr Thr Glu Val He Tyr Arg Leu Leu Glu Leu Ser Gin Thr Gin Gin 
\ 5 10 15 

Glu Gin Asn Glu Gin Asp Leu Leu Ala Leu Asp Lys Trp Asp Ser Leu 
20 ' 25 30 



Trp Asn Trp Phe 
35 



<210> 66 
<211> 47 
<212> PRT • 

<213> Human immunodeficiency virus 

<400> 66 „ ' _ m 

Trp He Gin Trp Glu Arg Glu He Asn Asn Tyr Thr Gly He He Tyr 
1 5 10 * 15 

Ser Leu He Glu Glu Ala Gin Asn Gin Gin Glu Asn Asn Glu Lys Asp 
20 25 30 

Leu Leu Ala Leu Asp Lys Trp Thr Asn Leu Trp Asn Trp Phe Asn 
35 40 45 
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<210> 67 
<211> 34 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 67 

Trp lie Gin Trp Glu Arg Glu lie Asn Asn Tyr Thr Gly lie lie Tyr 
15 10 15 

Ser Leu lie Glu Glu Ala Gin Asn Gin Gin Glu Asn Asn Glu Lys Asp 
20 25 30 

Leu Leu 



<210> 68 
<211> 37 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 68 

Tyr Thr Gly lie lie Tyr Ser Leu lie Glu Glu Ala Gin Asn Gin Gin 
15 10 15 

Glu Asn Asn Glu Lys Asp Leu Leu Ala Leu Asp Lys Trp Thr Asn Leu 
20 25 30 

Trp Asn Trp Phe Asn 
35 



<210> 69 
<211> 46 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 69 

Trp Gin Gin Trp Asp Glu Lys Val Arg Asn Tyr Ser Gly Val lie Phe 
15 10 15 

Gly Leu lie Glu Gin Ala Gin Glu Gin Gin Asn Thr Asn Glu Lys Ser 
20 25 30 

Leu Leu Glu Leu Asp Gin Trp Asp Ser Leu Trp Ser Trp Phe 
35 40 45 



<210> 70 
<211> 34 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 70 

Trp Gin Gin Trp Asp Glu Lys Val Arg Asn Tyr Ser Gly Val lie Phe 
15 10 15 

Gly Leu lie Glu Gin Ala Gin Glu Gin Gin Asn Thr Asn Glu Lys Ser 
20 25 30 

Leu Leu 
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<210> 71 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 71 

Tvr Ser Glv Val He Phe Gly Leu He Glu Gin Ala Gin Glu Gin Gin 
15 10 15 

Asn Thr Asn Glu Lys Ser Leu Leu Glu Leu Asp Gin Trp Asp Ser Leu 
20 25 30 

Trp Ser Trp Phe 
35 



<210> 72 
<211> 46 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 72 

Trp Gin Glu Trp Asp Arg Gin He Ser Asn He Ser Ser Thr He Tyr 
15 10 15 

Glu Glu lie Gin Lys Ala Gin Val Gin Gin Glu Gin Asn Glu Lys Lys 
20 25 30 



Leu Leu Glu Leu Asp Glu Trp Ala Ser He Trp Asn Trp Leu 
35 40 45 



<210> 73 
<211> 34 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 73 

Trp Gin Glu Trp Asp Arg Gin lie Ser Asn He Ser Ser Thr He Tyr 
1 ' 5 10 15 

Glu Glu lie Gin Lys Ala Gin Val Gin Gin Glu Gin Asn Glu Lys Lys 
20 25 30 

Leu Leu 



<210> 74 
<211> 36 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 74 

lie Ser Ser Thr He Tyr Glu Glu He Gin Lys Ala Gin Val Gin Gin 
15 10 15 

Glu Gin Asn Glu Lys Lys Leu Leu Glu Leu Asp Glu Trp Ala Ser He 
20 25 30 

Trp Asn Trp Leu 
35 
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<210> 75 
<211> 465 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (34 ) . . (357) 

<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 75 

agcggtgcgc cgaaagtacg cgctaagctt cat atg ggt att gtt cag cag cag 54 

Met Gly lie Val Gin Gin Gin 
1 5 

aac aat ttg ctg agg get att gag gcg caa cag cac ctg ctg cag ctg 102 
Asn Asn Leu Leu Arg Ala lie Glu Ala Gin Gin His Leu Leu Gin Leu 
10 15 20 

acc gta tgg ggc ate aag cag ctg cag gca cgc ate ctg get gtt gaa 150 
Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg lie Leu Ala Val Glu 
25 30 35 

cgc tac ctg aag gat caa ggc ggc ggc tea. ggc gec ggc tea gag tgg 198 
Arg Tyr Leu Lys Asp Gin Gly Gly Gly Ser Gly Ala Gly Ser Glu Trp 
40 45 50 55 

gac aga gaa att aac aat tac aca age tta ata cac tec tta att gaa 246 
Asp Arg Glu lie Asn Asn Tyr Thr Ser Leu lie His Ser Leu He Glu 
60 65 70 

gaa teg caa aac cag caa gaa aag aat gaa caa gaa tta ttg gaa tta 294 
Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu 
75 80 85 

gat aaa tgg gca agt ttg tgg aat tgg ttt gaa ttc ate gat gat ate 342 
Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Glu Phe He Asp Asp He 
90 95 100 

aga tec ggc tgc taa caaagcccga aaggaagctg agtttggctg ctgccacccg 397 
Arg Ser Gly Cys 
105 

ctgagcaata actagcataa ccccttgggg gcctctaaac gggtcttgag gggttttttg 457 
cttgaaag 465 

<210> 76 
<211> 107 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Synthetic 
<400> 76 

Met Gly He Val' Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala 

15 10 15 

Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 

20 25 30 

Ala Arg He Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin Gly Gly Gly 
35 40 45 
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Ser 


Gly Ala 
50 


Gly 


Ser 


Glu 


Trp 
55 


Asp Arg 


Glu lie 


Asn 
60 


Asn 


Tyr 


Thr 


Ser 


Leu 


He His 


Ser 


Leu 


He 


Glu 


Glu Ser 


Gin Asn 


Gin 


Gin 


Glu 


Lys 


Asn 


65 








70 






75 










80 


Glu 


Gin Glu 


Leu 


Leu 
85 


Glu 


Leu 


Asp Lys 


Trp Ala 
90 


Ser 


Leu 


Trp 


Asn 
95 


Trp 


Phe 


Glu Phe 


He 
100 


Asp 


Asp 


He 


Arg Ser 
105 


Gly Cys 













<210> 77 
<211> 197 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 77 

gagggactat atccggttat tcacaaggac ggctgtgggc gccatgatcg cgtagtcgat 60 
agtggctcca agtaaccgga agcgacaggg actgtgccgg gcgccaaagg cggtcgacag 120 
tgctttctag aaccgggtgc gcataaaaat gcatcacgcc tatagcgcta gagccgctgc 180 
attaaatgaa tcggcca 197 



<210> 78 

<2H> 18 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 78 

Tyr Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin Asn Gin Gin 
1 5 10 15 

Glu Lys 



<210> 79 
<211> 18 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 79 

Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu 
15 10 15 

Leu Asp 



<210> 80 
<211> 13 
<212> PRT 
<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 80 

Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe 
15 10 



<210> 81 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 81 

gggcccatat gggtattgtt cagcag 26 



<210> 82 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 82 

gggccggcgc ctgagccgcc gccttgatcc ttcaggtagc gttc 44 



<210> 83 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 83 

gggccggcgc cggctcagag tgggacagag aaattaacaa ttac 44 

<210> 84 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 84 

gggccgaatt cttaaaacca attccacaaa cttgcccatt t 41 
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SFQUGNCE LIFTING 



<11D> Wild, Csrl T. 

Weies, Csrol D. 

<12D> M^thnds of Eliciting ttroa-dlly Neutralising Antibodies 

O'J0> 15B9,01&PC£)l 

<140> 
<141> 

<:150> £0/115,404 
<151> 1999-01-OB 

<15D> to be assigned 
<L51> 2000-01-0? 

<1?0> Patent in Ve*. 2.1 

<Z10> 1 
<211> 5?? 

pkt 

<->3^> Jhunan immuciodef i-ix.i $J>^y —v-i nw -t-ypc -l 

<40Q> 1 

Ala fcrg Gin Leu leu Ser Gly lie Val Gin (Mn Gin Asn Asn Leu Leu 
1 5 10 15 

ajq Ala lie Glu A.13 Gin tin Kis Leu Leu Gin Leu Thr Val Trp Gly 
2Q 25 30 

lie Lys Gin Leu Gin Ala Axqf lie Leu Ala Val Glu Arg Tyr Leu Xyss 
35 40 

Aap Gin. Gin Lett Leu fcly He 
50- 55 



<21Q> £ 
<212> PRT 

<213> Artificial Sequence 
<22Q> 

<223> Description of -Artificial Sequence; 3 yptn&ti.c 
<d0&> 2 

ACT act L*u L*u Arg Als lie Glu Ala Gin Gin Bis Leu Leu Gin Leu 
1 S 10 15 

Thr Val 'J'rp Ely Ue LV R £3lri lj&u ^ Jrj r ^ e Aln Vai Gl13 

£0 25 30 

Axg Tyr Leu Lye Asp Gin 
35 



<210> ;i 
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<211> 36 

<212> l a fcT 

<213> Artificial Se^uonoc 

<22'~i> D&ftAf ip-tion of Artificial Sequence t Synthetic 

Sor Gly II* Vs. I- Gin GLn Gin Asn Asn le-u i-ew Arg Ala lie RJ.u Mot 
1 5 10 lii 

Gin Gin His leu Leu GJ.stj Leu Thr Val Trp Gly TJ.& Gin Leu Gin 

2D 25 30 

Ala Axg lie Leu 
3& 



<r£j 1 > $H 

<2l£> PET 

<2i:*> N ism tin i.«iftiui«def tcienry virus 

Trp Asn Met Thr Trp Met Gin Trp A«jp Ar^ Glu 21* 7i$*i J^ri Tvx 

3 5 10 15 

Thr Ssr T-e-U Zl« Hie Ser L-SU lie Glu Gl« Scr Gin Aen. Gin G.I 71 Glu 

so as 30 

Lys Asn Glu Gin Glu LEU Leu 61 u Leu Asp Lys Trp Ala Ser Leu T'rp 
35 45 

Asn Trp J?he Asn lie Thr Asn Trp 
50 55- 



<210> h 

<213> tivjn^n ijruFH>riCK3e f i fii^Yi r;y virus type 1 
<400> 5 

Tyr Thr Ser Leu lie His 3-er Leu lie Glu Glu Bex Gin Aan Gin GJ.n 
15 10 lb 

Glu Lys Aen Glu Gin Glu Leu Leu Glu Leu Asp Ly# Trp Ala Sex Leu 
£0 25 30 

Trp A A Ii Trp Phft 

3* 



<210> $ 
<211> 34 
<212> PM' 

<22U> 

<223> Description or Artificial Ssgu en uc : Synthetic 
<4DQ> 6 

Txp Wet Glu Trp Asp Arg Glu lie Asn Asn Tyx TJir Sex Leu lie Kis 
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1 5 10 15 

tier Leu lie Glu Glu S*r Gin Acn Gin Gin Glu bye Asn Glu Gin Glu 

20 25 30 



<210> "? 
<211> & 
<212> DHA 

<S13> Artificial Sequence 
<220> 

<223> Description of Arfcif icial Sequence : Synthetic 
<40Q> 7 



prt 

Hwn^n jn?iiMJjri ft efficiency virus type- 3. 

A.l* V&.l Gly T.V<r -Giv-Ala Lfiu-Phe L&u Gly F?ie Gly -ftln Ale Gly 

1 S lfl 15 

Sex Thr Met Gly A.ls i^g K&r MfcV. TLr Thr Va3 G]n Ala Arg Gin 
20 ?b 30 

Leu Leu Sex Gly lie Vsl Gin Gin Gin Asn Asn Lou Leu Arg Ala lie 
35 40 45 

Glu Ala Gin Gin Bis Leu Leu Gin Leu Thr Vsl Trp Gly lie lys Gin 
50 55 GO 

Leu Gin Ala Arg lie leu Ale val Glu Arg Tyr Leu Lye A£p Gin Gin 
6S " 70 7* 

l-eu Leu Gly 110 Trj> y Cys Ser Gly Lys Leu 7/\v Cys Thr Thr Ala 
£5 90 95 

Util Pro Trp Asn Ala SSr Trp S<sr Asn lys Ser Leu Glu Gin He trp 
100 105 110 

Asn Asn wet Thr l'rp Met Glu Trp Asp Arg Glu 11* Afin Afih Tyr Thr 
lt5 120 125 

Set Leu lie Hie Scr Leu J^e- Glu Gly Scr Gin Atn Gin Gin Glu Lys 
130 135 140 

Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser L&u Trp &£;n 
145 150 165 * 160 

Trp Fhe Ami llu Thr Asn Trp Leu Trp Tyr Lie Lys Il« Phe lie Met 
155 170 L75 

lie Vsl Gly Gly Leu Val Gly leu Arg .He Vsl Phft Als Val Leu J3* r 
380 190 
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II© Val A6n Arg Val Arq Gin Gly Tyr Ser l^io Leu Ser Phe Gin Thr 
155> " 200 20& 

Wis L«=*u Prd Thr **rO Ax-g Gly Px^o Asp Arg Pio £lu Gly rle- Glu Glu 
210 23J> 220 

G.ly *3ly £*ly Glu Arg Asp Arc? Asp Arq Ser lie Ar<i Lsu Val Ash Gly 
225 * " 230 ~ 235 2-1 0 

Ser Leu Ale* Leu He Trp Asp Asp Leu Are £&r -Cy 3 Pb* Sei% 

2*5 250 255 

Tyr Bi3 Arrp J..rtu Ar^ A.ep Leu Leu L&u Ha Val Thr Arg lie Val Glu 

£#0 26S 270 

Leu Leu Cly Ar^ Gly Trp Glu Ala Lay T.ys Tyr Trp Trp Asa *.eu 

275 230 235 

Leu Gin Tyr Ser Gin Glu Leu JLys Aaji Swi' Ala Val Ser Li?u 

290 ' 2§5 300 

Asn Ala Thr Al* IJ.<s Ala Va3 Ma GJu Gly Thr « r 'r^ VtfJ Ilr* Glu 

305 i10 315 320 

Val val tin Gly Ala Cys- Ar§ Ala lie Arg His lie Pro Ar-g Arq- ile 
325 330 335 

Arg Gin Gly Le-u Glu Arg lie Leu Leu 
3d0 3*5 



<21U> & 
<211* $5 
<212> PRT 

<213> BurRnn iTnniuft^R-flcienciy vi^fu* 
<4 00> 3 

Sex Glv lie Val Gin n G.I « Ar-n Asn Leu Leu Arg Ala 11© Glu Ala 
1 5 10 15 

Gin Gin His Leu Leu Sin Leu Thr Val Trp Gly lie Lys Gin Leu Gin 
20 25 30 

Ala Arg lie Leu Ala Val Glu Arq Tyr Leu Lye Aap Gin 
35 40 AS 



<21D> 10 

<2ll> 45 

<21Z> PRT 

2 1 3 > ttuman, irnmunodef jci ^rx:y virus 

<d00> 10 

Ser Gly lie Val Gin Gin G2n Abh Asn L*u Lev Arg Als He G.I u Ala 
3 i 10 15 

Gin Gin Kia Leu Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin 
20 2& 30 

Ala Arg Val Lou Ala Leu G2u Ars Tyr Leu Arc? Asp Gin 
35 4 0 4b 
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<210> 11 
<211> 3G 
<212> PRT 

<213> Human immunodeficiency varus. 
<40tt> 11 



GJy Lie val Gin Gin Gin as» Asn Leu Leu Arg Ala 31* Clu Ma 



Gin 



Gin Kis Lov l«i> «n Thr Val Trp <Hy He W Bin Leu Gin 



20 2* 30 



Ala Axa Val Leu 

35 



<210> 12 
<211> 3B 
<2i2> 

<L>13> Human immunodeficiency virus 

iJn D iJS 2 be B L*u Arg Ala TJ* GId Ala Gin Gin Kis Leu Leu Gin Leu 
. n 5 10 i- 

Thr V*l Trj> «y He Lys Clu Low Gin M* Are Val Leu Ala Leu Glu 

25 



Arg Tyr Lev Arg Aep Gin 

35 



<210> 13 
<211> 4 5 
<212> PRU' 

<213> Human innraunodef ini«ty virua 

^Gi^Il* val (iln Gin Gin ^ ^ Ala iLe Xia 

' " a ^ 5 3D i 5 

Gix. Gin Arg Wet Gin Thr Val Trp Cly He Ly* Gin Gin 

Ala Arq Val Leu Ala V*3 Glu Axg Tyr Leu Gly Aap Gin 
" 35 *0 ^5 



<21D> Id 
<2ll> 36 
<2L2> PRT 

<21-i> Human immun&dfcficiency virus 

£er°Gl^Ile Val Cln GJ* C3lft Aan Asn Leu Leu Arg Ala lie Glu Ala 

Gin Gin Arg Hat Leu Gl* Leu Thr Val Trp Gly He Lys Girt Leo Gin 
20 2S 30 

Ala Arq Val Leu 
35 
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<2io> is 

<211> 3fi 
<400> 15 

ftsn Asn lex\ Lc^ Arg 7M v T2r. Glu TUa Cln Cln Arq Met Ley Cln Leu 
1 5 10 15 

Thr Vnl Trp Cly lie Lv-v Gin L^u Gin Ala Arg Val Lou Ala Val Glu 
20 25 30 

Arg l'yr Leu Gly Asp Gin 

35 



<2X0> 16 
<21L> d5 
<212> PRT 

J3-e^ aiy 11k VaJ GTti Girt i?ln Asn Asji Leu Leu Arg Ala II© Glu Ala 
.1 5> 10 15 

Gin Gin Kis Wet Leu GLn Leu Thr Val Trp Gly lie Lye Gin Leu G1j> 
20 25 30 

Ala Arq Val Lets Ala Leu Glu Axg Tyz Lea Aj?y A*p Gin 
35 JO 4f> 



17 

<211> 36 
<212> VKV 

<213> Hiur&an liranunodef iciency vlru* 
<400> 11 

Ser Gly lie Val Gin Kin Gin Aftn A*n Leu Leu Arg Ala Il<? GLu Ala 
lb 10 15 

<SJn 6.1^1 Kis >tat L*u Gin Leu Thx Val S'rp Gly lit bys Gin Leu Girt 
20 2G 30 

Ala Arcs Val Leo 
35 



<210> IS 
<211> 36 
<Al2> PRT 

<#3 3> Humoin immunodeficiency virus 
<*0D> IP 

Asn Asn Leu Leu Arq Ala lie Glu TU.n Gin Gin His Met Leu Gin Leii 
1.5 10 15 

Thx Val Trp Gly J Aw Ly« Gin Leu Gin Ala Are va.1 leu Ala L^u Glv 
20 25 30 

Arg Tyx Leu Arg Aep Gin 
35 
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<210> 13 
<z?.ys.* PRT 

±2Y-$> Human ijiCTurocJeti^jHtft^y virus 

slr^li^llfe val Gin Gin Gin Rer An Leu L*u Arg Ala II* Glu Ala 

Gin Gin His Met Leu Gin Leu Tbr VjsJ. T*P Gly lift lya Gin Leu Gin 

20 ^5 30 

Ala Arg VcO I,i?u Ala lie Glu Ar<j Tyr Lea Arg Aaj> Gin 
35 40 ^ 



<2lD> 20 
s2ll> 36 
Q^12> FRT 

<ai3> Human imraMrKKSftiiciAncy virus 

w Glv He tfal Cln Gin GM fl« Asn Leu Leu Axg Ala lie Glu nls 
'l 5 " 15 

Gin G)xi His Met Leu Gin Lev T*r Val Trp Gly 11* Lya Gin leu Gin 
20 2& iC? 



Ala Ar^ VaI Leu 
3* 



<21to> 21 
<211> 3B 
<212> PKT 

<?.13> Human immwittdixUcieiifcy virus 
-;d&0> 21 



S«r Asn Leu Leu Arg A.l* Il« Glu Ala G.ln Gin His Wet Leu Gin Leu 
1 b 30 IB 

Thr VAl Trp Gly lie Ly& Gin lev GJji ftlu Are Vaj Leu Ala lie Glu 

20 2& 30 

Arg Tyr A»g Asp £ln 

3£> 



<21D> 22 
<21I> « 
c212> PRT 

<?.13> Human lmcnanodtef iciency virus 

Ser°Glv ? Il^ Val GJH Gin Gin Asn Asn Leu Leu Arg ALb lie Glu Ala 

Gin Girt His ifcU 'L^u Gin Leu Thx Val Trp Gly lie Lys Gin Luu Gin 

20 25 3D: 

Al* Arq val J^u Ala Val Glu Ser Tyr Leu LyS Asp Gin 
35 40 45 
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<210> 2% 
<211> 
<212> PT^T 

<213> Mum^n iiitmunadef icisncy' virus 

<40U> 23 , 

Asn Asn Leu Leu Ar<? Als lie Glu Ala Gin Gin "is Lftu Lsu Gin L$u 
1 5 10 IS 

Thr Val Trp Gly II© T..y* Gin Leu Gin Ala Arg Vsl Lrf?u Ala Val Glu 
20 25 30 

Ser Tyr LfeO Lys A£p Gin 



<210?> 24 
<211> 45 
<:212> PRT 

<213> Human .i Jirniunftrtef iciency virus 
<400> 24 

Ser Gly 11© Val Gin Girt Gin £er Asn Leu Leu Arg Ala lie Glu Ala 
L 5 10 15 

Gin Gin His Leu Lou. filn Leu Thr Val Trp Gly II** Lys Cln Le-u Gin 
20 25 30 

Ala Arc Val Leu Ala Val Glu Axg Tyr Leu Gin Aop Gin 
35 40 45 



<210> 25 
<211> 26 
<21?.> PRT 

<213> H^m^n iiwnunodef iciency virus 
<4 00> 25 

Ser Gly 11© Val Gin GJri C^in S&r Asn Leu L&u Arg Ala lie Glu Ala 
15 ID 15 

Gin Gin Wis Leu Leu Gin Lgu Thr VaJ Tr:p Gly lie Lya GLn Leu Gin 

20 25 30 

Al* ArQ: VmI Lien 



<£1U> 2* 
<212> PRT 

<213> tfuroan immunodDf j.c.iftnc:y virus 
<A00> 26 

flfir Asn L&o L&u Arg Ala lie Glu Ala Gin Gin Hi* T.eu l.ftu GJn Lou 
1 5 ID 15 

Thr Val Trp Gly lie Lvs Glu LGu Cln Al* Arg Val Leu Ala Val Glu 

20 2b- 30 

Arg Tyr Leu Gin Asp Gin 
35 
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<:210> 2"? 
<2U> 45 
£212> l*RT 

•■;213> Human immunodeficiency virus 

<*orj> 2? _ ^ hl 

Sr-r ply II* Val Gin 0I» Rlr> B*r A*n J,mi L«u Axg Ala II© Glu Ala 
'] 5 10 1& 

G3n Sin His Leu Leu Gin Luu Thr Vsl Trp GJy He Lys Gin Leu Gin 
20- 25 3D 

Ala Arc? Val leu Ala Leu. Glu Arg Tyx Leu Arcj ASt? Gin 
35 40 * b 



<2L0> 29 
^2L1> 3B 
<212> PRT 

<c213> JJuman iramonoeiftficiBncy virus 

<4QD? 2B " , , 

iier Asn 1-eu Leu Axcr Ala Glu Ala Girt Gin His Leu Leu Gin Leu 

1 5 ]r) 15 

Ti-ir Val Txp Gly lie Ly* Gin T*w G.ln A.la Tin? Vai Leu Als Leu Glu 

Aj?q Tyr Leu flrg Aap Gin 
35 



<21 3? Human immunodeficiency virus 
<*00> 29 

S-er Gly II r- Vol GIs-j Gin Gin Ser Aan Leu Leu Arg Ala lie Cln Ala 
1 S 10 15 

Gin Gin His M*t Leu Gin Leu Th* Vrti T rp G.1 y Val Lys Gin Leu Gin 
2D ' 35 S0 

Ala Arg vai l&u Al3 Val Glu Ax^ Tyx Leu Lys CUft 
3S -90 4 5 



<21Q> "30 

<211> 36 

<212> POT 

<213> HLiinar. ammajKxSef J.cienc.V virus 

<JQ0> 3D 

q cr m v 31 r vaj Gin Gin Gin £*r Asn Lieu Leu Ajg Ala lie Gin Ala 
1 5 10 13 

Gin Gin Jiia K*t Leu Gift Ltu TJir VyJ. Trp Gly Val Ly£ Gin Leu 61 a 
20 25 ^0 

Ala Arg vsl l.*n 
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<210> 31 
<211> 33 
<212> fbt 

<2 1 ??> human iriijimi"3C<iit= f i ci en cy virus 

r>ftj* R^r*. J*jv Luu Aia 13 ft Gift Ala Gift C31*> Hie M*t. L^u Gin Leu 

1 5 HO 1& . 

Thr Val Trp Gly Val Lys Cln Leu Gin ALo Axg Vw.1 Le-u Als Val Glu 
2D 25 30 

Arg Tyr L*u Lys Asp Gin 
35 



<210> 32 
<21L> 45 

<:213> Human immunodeficiency virus 
<400> 32 

Ser Gly lie val Gin Gin Gin sex Asn Leu L*u Lya Ala. lie Glu Ale 
16 30 15 

eian GJn His Leu L*ju I-ys I^p Thr Val Trp Gly Us Lys Gin Leu Glr* 
20 25 30 

A3 a Arc? Val Leu Ala Val Clu Arg Tyr Leu Lys Asp -Sin 
35 40 45 



<210> 33 
<211> 36 
<212> FRT 

<213> Human immunodeficiency virus 
<*0G> 33 

Ser Gly ll-e Val Gin Cln Gin 3-er Asn Leu Lya Ala J.I £ Glu Ala 

15 10 15 

Gin Gin Hi* J.isu L£\j Lys Lfc« Thx- Va.l Trp Gly 73c Lys Gin Leu Gin 
20 25- * 30 

Alp Arg V^l L«u 
35- 



<210> 34 
<211> 3S 
<212> PKT 

<213> HuiaaiV iMiiuftorhgtfici^ncy v j .tu-p 

Ser Asn Leu Leu LyB Ala lie Glu Ala Gin fi]n Hi a "Ley Lou Lya Leu 
1 5 10 15 

Thx Val Trp Gly Tie Lys Gin Leu £ln Ala Arq val Leu Ala val Glu 
20 25 30 

Arg Tyr Leu Lys Asp Gin 

35 
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<210> 35 
*211> -45 
<212> PRT 

<213> Kuman i mniimodef iciency virus 

<400> 3D ^ 

Ber Glv ll& Val Cln Gin G.Uj TiAii J.U*! ].&u Lau Axg Ala lie Glu Ala 

J. 5 '10 13 

Gin 61" His Leu leu Gin L*u Ser II© Trp Gly Ilu LyS Gin T-*" 51 ri 
20 25 30 

Ala Lys Vnl Lct fiXs lie Glu Arg Tyx Leu. Axg Aap Gin 
35 -J 0 i5 



<2L0> 36 
<212> PRT 

<213> Human irnmunoderi ftienrry virus 
o*0D> 36 

Ser Gl" 11* Val Gin 6.1 n 5 1 an A*n T.lO Lew Leu Arg Ala lie GLu A.l.« 
1 J 5 10 IS 

Gin Gin His Leu Leu GU> Lei? Sor lie Trp Gly I).* Lys Glri Iftu G3.r> 
20 25 30 

Ala lys val Leu 

35 



<2LD> 37 
<211> 3B 
<212> PRT 

<213> Human immwicriciicidJicy virus 
<*D0> 37 

Aen He Leu. Leu Arq Ala II* Glu Ala Gin Gin ttis Lou L^u Gift Leu. 
1 & 10 15 

S^r T.I. a Trp Gly lie Lys Gin leu Gin Ala Lye Val Leu Ala lie Glu 
2-D " 25 30 

Arg Tyr Le-u Arc? Asp -Sin 
35 



<21U> 3S 

<211> 45 

<212> PRT 

<213> human imimnodBiiuisnuy virus 

<dt>0> 3G 

Lve GJ V lie Va.1 Gin C51n Gin Asp Asn Leu Leu Are Ala lie Gin Ale 
L & 10 IS 

Kin Gin Gin Leu Leu Arg Lbu Sex Xaa Trp Gly lie Arg Gin Leu Axg 
M 2b 30 

Ala Arg Leu Leu Ala Leu Glu Thr leu Leu Gin Asn Gin 
35 i0 45 
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<2i0> 
oiii> :*5 

Hvman immunodeficiency virus 
<4DO> 39 

Lys- Cly I no Val Gin Gin Gin Asp Ago leu Leu firr^ A3* 11© Gin Ala 
15 10 15 

Gin Gin Gin 3Um L*U Arg leu Sex Xaa Trp Gly lie Arg Gin Leu Axg 
20 25 30 

, Ala Arg Leu 
35 



<2ltt> 40 
<S1^> PPT 

<21'S> tfumsm iiftmunodlef iciency virus 
<■! 0O> 40 

Asp fi*n i.«u Leu Arq Ala lie Gin Ala Gin Gin Gin Leu Leu Axg Lfcu 
3 5 10 ID- 

Sox X<*# Trf> 63 y llfr Arg Gin Leu Axg Ala Arg Leu Leu Ala leu Glu 
20 " 25 30 

Thr leu Leu Gin Aisn Gin 
35 



<21U> 41 
<211> 46 
<212> PRT 

<213> Human immunodeficiency virus 
<d00> 41 

Txp Met Glu Trp Jlsp Axg Glu lie Aan Asn Tyr Thx Ser leu lie His 
1 5 10 15 

Sftrf leu Tie G-lu Glu Sex Gin Asn Gin Gin Glu lye Ann GLu Gin Glu 
20 2£ 30 

Lou Lt?u Glx\ Leu A tip Lyft A.l£ 5er T.*u Tr1> A fin Trp PJks 

35 4b 



<210> 4 2 
<211> dfe- 
<212> P-RT 

<213> Hunan immunodeficiency virus 
<A0Q> $Z 

Trp Mttt Glu Trp Glu Arg Glu ric Glu Asn Tyr Thr Cly Leu lie Tyx 
1 5 10 1!> 

7*hr Leu lie Glu Glu Der Gin Asn Gin Glr> Glu Ly3 Asn Glu Gin Asp 
20 25 30 

Leu Leu Ala Leu Aap Lys Trp Ala Ser Leu Trp Asn Trp Phe 
3& AO 45 
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<21Q> A3 
<211> 34 
<212> PR'J' 

<2l3> Human immunoctef i c;.icrn:y virus 
< A r>o> 4 ^ 

Trp Mnl" Ri« Trp Glu Ars Glu lie Glu ton Tyr Thr Gly Leu lie Tyr 
1 5 3 » 13 

Thr L*u lis Glu ser Gin Asn Gin Gin GJ* Lya Gln 
£0 2b 30 



Leu Leu 



<210> 4 4 
■C211> 36 
<212> l v RT 

<2L3> human ironurotfef ieienry virus 

ryr°rhr^ly Leu He Tyr Thr Leu Ufi Glu Glu Ser Gin Asn Gin 

GUJ M& Aen Glu tiln Asp Leu leu Ala ^ *$P L V S Tr P * la 3ex 
20 £5 50 

Trp ft-sn Trp Phe 
3h 



<210> 4 5 
<?11> 4 6- 
<2L2> PRT 

<213> Busnan imntunortcl-ltifrney virus 

l4p D Wet 5 Glu Trp Glu Are Glu lift Asp Aen Tyr Thr Ser Clw Lie Tyr 

Thr Leu lis Glu Glu Ser G3.n ^ft Gin Gin Glu Lys Asn Glu Cln Glu 
2D 2* 30 

Leu Leu Glu Leu Asp Lys Trp Ala Ser Lev Trp Aan Trp *he 
35 4 0 4£ 



<210> 4& 
<211> 34 
<2L2> PRT . 

<213> Buman immunodeficiency virus 




Thr Leu Hp Glu Glu iter Gin Asn Gin Gin Glu T.ys *ftn Glu Gin Glu 
20 2* 311 



Leu Leu 
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<511> 3« 
<212> PRT 

<213> Human inrniunOcSof icl-ei^y virus 
«500> d7 

Tyi Thi Set Glu lie Tvr T'hr leu lie Glu Glu 5er Gin Asn Gin Gin 
1 5 30 .3-5 

■Slu J.yA Asm Glu Gin Glu Leu Leu Glu Leu Asp Lya Tip Ala brer Leu 
20 25 30 

T.cp J^ri* Tip Phe 

<210> 4 H 
<211> 46 
<212> PRT 

<213> Eiuman immumodcf lcJ.*r>cv v i 

Tip Met Glu Tip Clu Ar<] Glu I2r^ Asp Asn Tyr Thr Asp Tyr lie Tyr 
15 10 15 

Asp leu leu Glu Lvs Scr Gin Thr Gin Gin Glu lys Asn Glu Lys Glu 
20 " 25 30 

Leu Leu GLu Leu Asp Lys Trp Ala Sei Leu Tip Asn Trp Phe 
35 40 *5 



<210> d£ 
<2L1> 3^ 
<21Si> PRT 

<2J 3> Human ijniau^Cf^riciftrif/y vims 

Tip Met Glu Tip Glu At<i Glu lit? Asp Asn Tyr Thr Asp Tyr lie Tyr 
lb 10 15 

Asp Leu Leu Glu Lys Scr Gin Thr Gin Gin Glu Lye Asn Glu Lye Bin 
20 25 30 

Leu Leu 



<210> 50 

<211> 26 

<212> PRT 

<2 ]. 3 > Huroa n ' innnml Ode f X C J enc V v i rus 

<400> £0 

Tyr Thr ftsp Tyi lie Tyr Aap -b&u Leu Glu i.yp s*.r G.l ft Thr GJ.j> Gin 
"l !a 10 L5 

Glu Lys Asn Glu Lys Glu Leu Leu Glu Leu Aap Lye Tip Ala Uar Leu 
20 25 30 

Trp Asn Trp Flie? 
35 
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<210> bl 
<211> ifc 
<2l2> PRT 

<£:3> Human irflmunodfrf icier-cy virus 
<AQQ> 51 



Trp lie Gl» Trp Aa p Are Glu He Sex Asn Tyr Thr Gly He lie Tyr 
1 5 10 

Jra Leu Leu GLu Glu Spx- GJ.n Asn GJr> 61D hys Asp 

20 25 30 

l*u Leu Ala Leu Asp Lys Trp Gin Asn Leu Trp 3*r Trp Phe 

40 



<21Q> 
<213> TOT 

<213> flwni^n .i m»un ode £ icier rsey virus 

Trp°ile^Gln Trp Asp h^q RJ«; Sis* A.™ Tyr Tin- <?3y lie He Tyr 

ten Leu Leu Glu Glu Sor ^ln Am Gin Gin Glu A*n Asn Glu Lys Ae P 

20 25 30 

Leu Leu 



<:210> 53 
£21'i> 36 
<212> m 

<213> Human immunodeficiency virus 

Tyr°Thr^ly 11c 11c Tyr Ar<i Lev Lw Glu Glu GJn Asu Gin Gin 



Glu Asn Aan Glu Lvs Asp Leu leu Ala Leu Aap Lys Trp Gin Asn 
20 - 2D 30 

rrp £er Trp Phe 
35 



<2J0> 5d 
*2H> 4fc 

<?ja> prt 

<213> Buman iramunodef i ciency virus 
<400> 54 

Trp Met Glu Trp Glu Arg Glu He Ser Asn Ty* Thr Gly Leu I1& Tyr 
1 & 50 15 

Asp Lsu lie Glu Glu Ser Gin lie Gin Gin Glu lys Asn Glu Lya Asp 

20 2& M 

Leu leu Glu Leu Asp LyS Trp Ala Eer Leu Txp fan Trp Phe 
35 -10 ^ 
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<2l3> Hum aii imriujncjd^f ici&ncy virus 
<40Q> 55 

Trp Met Glu Trp Glu Arcj G.lu lie Scr Asn Tyx Thr Gly Leu lie Tyr 
15 10 15 

Asp 'Leu He &lu Glu Ser Cln lit Gin Gin Glu Lys Asn Glu Lys 74i?p 
20 2D 50 

LfrU lifcU 



<2lD> bf- 
<2il> 

<212> JJluaia*i isnmunode f iclency viniK 

Tyr Thr Gly L£i* 1.1* Tyr Asp Leu. He Clu Glu Ser Gin lie Gin G.l.n 
1 5 10 1F> 

Glu Lys Asn Glu Lys Asp Leu Leu Glu Leo A»p Lye Trp Al* f?e.r L*u 
20 2D :io 

*Trp Asn l"rp Phe 

as 



<210> 5? 

<211> *6 

<212> PRT 

<212> fluman immunodeficiency virD.*: 

<400> 5? 

Trp Mel: GJ u Trp film Lys Glu i;* 5cr Asn Tyr Ser Asn Glu tfal Tyr 
1 5 10 15 

Arq He Glu Lys Gin Asn Gin Oln Glu bye Asn Glu Gin Glv 

SO 25 3fJ 

Leu Leu Ala Leu Asp Lys Trp Ala Ser l^U Trp J4*n Trp Phe 
35 40 45 



<210> DS 
<211> 34 
<2l2> PRT 

<£l :\> Bwntan* irranunodef .iciency virus 

Trp frfe-t 151 u Trp Gin I.ya Glu lift Bor Asn Tyr Ser Asn Glu Val Tyr 
IS 10 15 

Arq LCu He Glia Lys Scr Gin Asn Gin filr> Glu Lys Asn -31 u Gin Gly 
20 £5 3D 

Leu LifiU 
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<ZlO> 
<211> :i6 
<212> PRT 

<21$> HurtSrt immunodeficiency virus 
<>300:> 



Tyx 2er Asn Val 3'yr Arg leu JJ<s Glu Lys £er Gin Asn Gin Gin 
1 J» IP 15 

<5ly Lye Asn Glu Glr> Gly Leu Leu Ms Leu Asp l-ys Trp Ala Ser lieu 
20 25 3D 

Trp flr.u Trp Phe 
3& 



<210> en 

<21L> 4 6 

<212> ?FlT 

<213> Human immunodeficiency ^.inis? 

<400> 615 

Txp n* Gin Trp Asp Arrj Glu I J ft fle-r Asn Tyr Thr Gin Gin He Tyx 
1 5 10 15 

Ser He <5lii Glu Sex Gin Acn Gin GJn Glu Lya Aan Glu Gin Asp 
£0 25 30 



Leu Leu Ala Leu Asp Asn 'J'rp Ala Sex Lau Trp TTix- Trp Fhe 
35 40 4& 



<g*D> 61 
<2)1> 34 

<213> Human immunodeficiency virus 

<400> 6? . 

Trp He Gin Trp Asp >irg Glu II© Scr fcsn Tyr Thr Gin Girt He Tyr 

1 5 10 15 

Ser Leu lis Glu Cfiy Ser Gin Asn Gin Gin Glu Lys Art a Glu Asp 
20 25 30 

Leu Leu 



<210> 62 
<233> 36 
<212> PUT . 

<213> HumnO immunodeficiency virus 
<400> 62 

Tyr Thr Gin Gin Lie Tyr Sfcr Leu JJe G3LM Glu i3er Gin Asn Gin Gin 
] 5 10 15 

Glu Lys Asn Gtu Glu fcsp Leu Leu Ala Leu Asp Asn Trp Ala Scr Leu. 
20 25 30 



Txp Thr Trp Phc 

35 
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o*ll> 4ft 

<£23:> Hvni^Ji Immunodef ici-ency vi rw: 

Trp Wet GJu Trp Asp Aig Gin lie Asp Asn Tyx Thr Glu. V»3 Tie Pyx 
L D 10 15 

Arq Leu Lvv Rlu IjftU Ser Gin Thr Cln Gin Glu Gin Asn Clu Gin Asp 

20 25 ' . 30- 

Led L-bu Ala leu ftsp Lys Trp Asp fier Leu Trp Asn Trp Pbe 
35 dQ 45 



<2l0> 

3> PRT 

<5I3> Human irramjnodef iciency virus 
<*-a D p > ft * 

Tirp Mnt (3.1. ii Tjrp Asp Arg Gin lie- Aep Abu Tyr Thr Glu Vai 11© Tyr 
1 ID 15 

Arc? Inij Lw Glv Leu Sear GLn l k hr Gin Gin Glu Gin A an Glu Gin Asp 
SO 25 30 

L*CU: LOU 



<210> 65 
<211> 36 
<212> PB.T 

<213> Jiuman iiniBuncKfcfj Cheney vj f g$ 

Tyr Thr Glu v«al lie Tyr Arq Lei: Leu Glu Leu Ser Gin Thr Gin Gin 
"l 5 10 15 

Glu Gin Asr.n fili> Gin Aap Leu Leu Ala L*u A»p Ly£ Trp Asp £?e/ Leu 
20 2S 30 

Trp Asn Trp Phe 

35 



<23fl> 66 

<2ll> J? 7 

<212> PRT • 

<213> Huanan Aftrnt>ri<KSfeI'icleni-y iriru-s 

<400> £6 

Txp lie Gin Trp Glu Arq Glu lie Ai>u Asn Tyr Thr Gly lie He Tyr 
2 5 10 15 

Ser Lfru Lie Glu Glu Ala Gin Asn Gin Gin Glu Asri Aen Glu Lys Asp 
2D 2* *° 

lieu Leu Ala Leu Asp Lvs Txp Thr Asn Leu Trp Asn Txp Pho Asn 
35 - *U *5 
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<210> 67 

<211> 34 

<2'12> PICT 

<213> liuman iiranvmodef i 0 i ency virus 

<4W)> tp"F 

Trpa Ho esi-xi Trp Glu Arg Clu J.le Aaa asyj Tyr Thr Gly lie He Tyr 
] & 1A 16 

Ser leu Ho Glu Glu Ma Gin Ago Gin Gin G.lo Asn ftsn Glu i,ye Asp 
50 25- 30 

Leu Leu 



<210> 63 
<2H> 3"? 
<212L> FRT 

<213> Human immunodE-f i cier*cy virus 

Tyr TU^ fily lie He Tyr Scr T-ru Lift Gl" Glu Ala Gin Asn kin Gin 
1 ' 5 HQ 15 

Rlv A *n filu Lys Asp Leu Lev Ala Jaju ft'jp Lys Trp T&x Aen Leu 
20 25 30 

Tjrfi Ajsn Txp Pbfi AjSIi 

35 



<Z10> 69 
<21l> AS 
<212> PRt 

<:213> Human iirammocteiiciency virus 
<dDO> 69 

Trp Gin Gin Trp Asp Lys Val .tec? Aan Tyr Ser Gly Val II* Fhe 

Gly Leu lift Glu Gin Ala Gin Glu Gl.il GJ n Asn Thr Asn Glu Lys Ser 
20 25 30 

Le-g J-ev Glu Lftu Asp Gin Trp Asp Ser Lou Trp Ser Trp Ph* 
35 40 



<210> 70 
<2ll> 34 
<212> PKV 

<213> Human iiranunoder ioi Gu&y virus 
<4Q<5> 10 

Trp Gin Gin Trp to Glu lys Val Arg AHn l'yr Ser Gly Val He Phe 
15 ID 1& 

Gly Leu lie Glu Gin Ala Gin Glu Gin Gin ftsn Thr Asn Glut Ly£ Ser 
20 25 30 

Leu L u 
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<212> PKT 

<2l:\>> MuTiian imiTLunodef onfry ^.i rvs 

Tyr f5or fily Val He l J he Gly Leu L?.e S.lu Gin Ala RLn Glu B.I n fi I n 
I & 10 "15 

Aan Thr ft.sr> Glu ].ye Ser l^eu Leu Glu Leu A3[> GI51 Trp Asp Sox Lou 
SO 2S 30 

Trp Sex- Txp Ph>* 
3b 



-121 1 > «; 
-t212> PRT 

<213:=- Hi'm^r, i nui:urbOdrt fieri ertcy virus 
<4 00> 7?. 

Txp Gin Glu Trf rt*3f> ^'9 Gin. Tie- 3&r Asn xle £ei Ser Thr lie Tyx 
lb 10 15 

Glu Glu ILe Gin Lyn Ala filn Wal fi Ln ftln Glu Gin Asn Glu IbyA JjVA 
20 3f> 30 

Leu L-eu G2ti Lieu Asp Glu Txp Alii 5*?r He Trp ftswi Trp Leu 



<210> 73 
<212> FRT 

<2~±2>> Human immursodef ici^"c?y vfiruN 
<dD0> 73 

Trp. Gin Glu Txp Asp- Arq Gin lie Sex Agji lie Scr 5er Thr Tie Tyr 
1 & 10 15 

Gtu Glu He- fifrs Xye Ala Gin Val Gin Gin Glu Gin Asn Glu i_.ys Lys 
S.0 25 so 

Lev L*=u 



<2ll> 3 0 
<212:* PRT 

<213> Ham^n immunodef icitsn^y virus 
<ijC-0> 7 \i 

ser Tjjr Tie Tyr Glu Glu lie Gin l*ya Ala Gin Veil Gin filr> 

l 5 1Q 

Glu Gin A-sn 13 Lu Lys Lys Lou Leu Glu L*u. Asp Glu Trp Ala Sex ILe 
20 25 3D 

Txp A-^ri Trp Leu 

35 
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<210> 75 
<211> -165 
<212> DNA 

<213> Artificial Sequr;^:* 
<22i> ens 

<222> (3-?) , . 135?) 
<22u> 

<223> Description Of Artificial 5*QU*nc*: Synthetic 
<4GO> 75 

ageggtgege cgaaaqta&g cgotsiagctt cat at 5 qgl ai.L ytt cag cag cag 54 

H«t Gly Jlc? V^O Gin Gin Gin 
1 5 

aac flat ttg ctg agg get att gag gcg caa cag cac ctg ctg cag ctg 102 
Asn Artrt Leu Arg Ala lie Glu Ala Gin Gin His Leu L&u Gin Leu 

ID 15 20 

qta tgg ggc ate aag cag ctg cag gca cgc ate ctg get gtt gas 150 
Thr VaI Trp Gly lie Lys Gin Lbu Gin Ala Arg He Leu Ala Vsl Glu 
fiS 30 35 

cgc tac ctri *ag gat czz ggc ggc ggc tea gege gec ggfc tea l<$$ 
Arq Tyr Leu Lvts Asp- Gin Gly Gly Gly '31 y Ala Gly S&r Glu Trp 

46 ' 4iS 50 55 

gac aga gaa att ^^r- set La£ ar.a 3G« tt? cats ca^ Uf:r_ M-.w ott gaa 24$ 
Asp Arq Glu 11c Asn Aftfl Tytf Thr Sr?r L^u lie "is L<sv 11^ Glu 

60 55 70 

gaa teg caa asc cag can. gaa sag aat gaa caa gaa tta ttg gaa tts 29$ 
Glu iter Gin Asn Gin Gin Glu Lys Asn Glu £ln Glut Leu Leu Glu Leu 
75 BO 35 

qat. aaa tgg gca agt ttg tgg aat tgg ttt gad ttc ate gat gat ate: 342 
i$p Lys Trp Ala S«r Leu Trp Aan r J'rp Ph& Glu Phe lit Asp Asp He 
3fl &£ 100 

aga tec ggc tyo t£<? paaflGCor.SA atfKji.toagctg agtttggctg ctgccacccg 397 
Arq Ser Gly Cys 
105 

ctgsgcaata actagcatsa ccccttgggg gcctctaaac gggtettgag gggtttlltq 45? 
C.ttgaaag 435 



<?.lt» 76 
<2>n> ]0V 
<2LK> wr 

<:213> ArtificiaJ Sequ^ftce 

<223> Description of Artificial Sequence: synthetic 
<*D0> 76 

Met SJy i-le val Gin Gla 5.1 n Asn ftsn Lou Leu Arg Ala lie G2u Ala 

15 10 15 

Gin Gin His leu Leu Gin Leu Thr Vai Trp Gly lie Lye Gin Gin 

20 25 30 

Ala Arg lie Leu Alo Vsl GLu Arg Tyr Leu Lys Asp Gin Gly Gly Gly 
35 40 45 
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Sdt Cly Aln ^7;.^v ^ r Trp Asp Arg Glu lie Asu Asn Tyjr- TJn- S#t- 

50 I>£ €-0 

Lei« He Kis Sex T^u He- Glu Glu ESex Girt As ii Cln Gin Glis Lyy Ar-n 
65 j'O 80 

Glu Gin Glu Leu Leu G) 0 J.tn> Asp ).y.s Trp Al* S*r T^y TrfJ ftzn Trp 

85 9C1 95 

l^he £lu l J he He Asp A*}> Arg Sr?r Gly Cys 

100 105 



<210> 7? 
<211> 1S7 
<2:«2> DMft 

<£j.3> Art:i filial Sequence 
*2SP> 

<223> DcBt?r*.pt£a" of Artificial Gequ-s/ic* : Synf.be 1:1 <: 
<40Q> 7? 

gngggactat atcoy^ttost tC£ a Cf £«C i^qot^tgggc gucAtgatug cgtagtcgat 60 
acrtggct cc-a agtsaccgnz^ $ g<: ij h c*? ij c;<; yctqt qcc^gg qcgc^astagg cggtcga&ag 120 
tgctttctsg aaccgggtgft <j ? <■ *?-a t qcaicacgcc tatagcgcta gagccgctgc LSO 
attaaatgas tcggcca 197 

<210> 7B 
<21L> IS 

<2"12> FRT 

<213> Artificial Sequence 
<220> 

<223> Description nl* Artiiicrial 5ft^u*r>^: S>na1:be1:i<; 
<4lJf» 7I3 

Ty* Th/' B«sr Lau I.J u Hifc Bfrj* I-fcu Jlw Glu Glu ^ttr 13.1 n 3ten Gin Gin 

i -a jci 15 

Glu L^e 



<r210> 79 
<211> IB 
<212> VKT 

<21 3> Arti£icifl£ 5*^i?encrft 

< 2 2$ > Dciicript i on cf Art i f i c i a 1 Sequen ca : Syn t het ic 
<400> 79 

Glu Glu Ser Gin Asn Gin Gin Glu Lya Asn Glu tin Glu T.fcu Glu 

1 5 10 15 

Leu Aap 



<210> BD 

<211> 13 

<212> PRT 

<2t3> Artificial 
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-23- 

<223> Dtssuripli of Ari,iiic.i &\ 5<w$nr;c; Synthetic 
<40O> 80 

Leu Glu Leu Asp Lvs Trp Ala Ser Leu Txp Asn Trp Phe 
1 & 10 



<2\ti> Hi 

<213> ftxliXioial 5equ fence 
<220> 

<2S3> Dsacrj pt i ^ft of ftft.if i cial Sequence: Primer 
<40n>. 81 



<210> 82 
<211> 44 
<212> DNft 

<213> Artificial Sequence 

<220> 

<223> Eeficriptlon of Aztif iniai Sequence: Frjjircsr 



<d00> £5 

e^geogg^r. titgagt^ijr; QccUystoc ttc^ggtagc csttc U 

<%-}Q> 83 
<211> 4<5 
<2J.3> DNA 

<213> Art.if?.cifll Sequence 
<22G> 

<2 2 3 Description of Artificial Se^uenc^; Pointer 

gqgccqgcgc eggctcagag tgggacaga^ aaatlaacaa ttoc 44 
<210> £0 

<213> Artificial Sequence 
<220> 

<223> rescript ion of ftrtifici.$J Syqusncs: Primer 
<40D> $4 

gggecgaatt cttaaaacca attccacaas cttgcfctatt t dl 
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